This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 




Patent Office 
Canberra 



Quoo/ooU>M 

I, KAY WARD, TEAM LEADER EXAMINATION SUPPORT AND SALES 
hereby certify that annexed is a true copy of the Provisional specification in 
connection with Application No. PQ1675 for a patent by ST. VINCENT'S 
INSTITUTE OF MEDICAL RESEARCH filed on 19 July 1999. 




WITNESS my hand this 
Twenty-sixth day of July 2000 



KAY WARD 

TEAM LEADER EXAMINATION 
SUPPORT AND SALES 



PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 




AUSTRALIA 
Patents Act 1QQ0 



PROVISIONAL SPECIFICATION 



Applicant (s) : 

ST. VINCENT'S INSTITUTE OF MEDICAL RESEARCH 

Invention Title: 

INHIBITOR OF OSTEOCLAST PRECURSOR FORMATION 



The invention is describe -; r> ^ Q ^ n 

uescriiDed in the following statement: 




- 2 - 



INHIBITOR OF OSTEOCLAST PRECURSOR FORMATION 

This invention relates to a polypeptide factor 
which is able to inhibit the formation of osteoclasts. In 
5 particular, the invention relates to a factor which 

inhibits the differentiation of haematopoietic precursor 
cells into mononucleate osteoclast precursors. In a 
preferred form of the invention, the factor is a type II 
membrane polypeptide expressed on the osteoblast cell 
10 surface, which we have designated osteoclast inhibitory 
lectin (OCIL) . 

BACKGROUND OF THE INVENTION 

In normal adults, the processes of bone formation 
15 and resorption are balanced in order to maintain a normal 
healthy bone mass. With the onset of the menopause in 
females and with ageing in both sexes, the rate of bone 
resorption exceeds that of bone formation, resulting in net 
bone loss, and ultimately in osteoporosis. 
20 Osteoblasts are the bone cells responsible for 

bone formation, while osteoclasts are responsible for 
resorption of bone. Our understanding of the factors that 
regulate the formation and function of osteoclasts has been 
greatly enhanced by laboratory methods that have enabled us 
25 to isolate and grow these cells in culture. It is now well 
established that the development of active osteoclasts in 
vitro requires intimate contact between osteoblastic 
stromal cells and precursors of osteoclasts which are 
derived from haematopoietic cells belonging to the 
30 monocyte/macrophage lineage (Takahashi et al, 1988) . This 
process is influenced by a variety of factors, including 
1, 2 5-dihydroxyvitamin D3 , parathyroid hormone, 
prostaglandin E2 , and interleukins 6, 11 and 17, all of 
which enhance osteoclast formation. In contrast, cytokines 
35 such as interleukins 4, 10, 13 and 18 are inhibitory (Suda 
et al, 1995; Martin and Udagawa, 1998) . 
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All factors which stimulate osteoclast formation 
act directly on the osteoblast population and not on the 
osteoclast precursors, leading to the proposal that 
osteoblasts or stromal cells express a membrane-associated 
peptide that regulates the formation of functional 
multinucleate osteoclasts. a factor, termed "Osteoclast 
Differentiation Factor" (ODF) , that fulfils the functions 
of such a putative membrane-associated peptide has recently 
been cloned. ODF encodes a 316 amino acid type II 
transmembrane protein, and is a member of the TNF ligand 
family (Yasuda et al, 1998) . Recombinant protein 
corresponding to the extracellular domain of ODF stimulates 
the formation of active, bone-resorbing osteoclasts from 
haematopoietic cells within the spleen, even in the absence 
15 of stromal cells. A peptide identical to ODF has also been 
cloned from T cells, and designated Tumour Necrosis Factor- 
related activation-induced cytokine (TRANCE) , or receptor 
activator of NF-KB ligand (RANKL) (Wong et al, 1997). when 
released by T cells following activation of the T cell 
20 receptor, it mediates the interaction of T cells and 
dendritic cells, resulting in stimulation as well as. 
increased survival of the naive T cells. RANK, another 
member of the TNF-receptor family, has been identified on 
dendritic cells, and acts as the receptor for TRANC E / RANKL 
25 (Anderson et aJ, 1997) . 

Osteoprotegerin (OPG) is a soluble factor that 
belongs to the Tumour Necrosis Factor (TNF) receptor 
family. This factor is also known as Osteoclastogenesis 
inhibitory Factor (OCIF) . it has been shown to bind to 
ODF, resulting in the inhibition of formation of functional 
multinucleate osteoclasts in vitro. OPG is a 401 amino 
acid polypeptide. Overexpression of OPG in transgenic mice 
results in severe osteopetrosis, with a loss of bone marrow 
cav ltl es and profound depletion of osteoclasts. The same 
effects were observed upon administration of OPG to normal 
mxce. Furthermore, OPG blocked ovariectomy-associated bone 
loss in rat. OPG mRNA transcripts have been identified 
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within bone and cartilage, vascular structures, midgut and 
kidney, and in several osteoblast cell lines. Current data 
suggest that OPG blocks the terminal stages of osteoclast 
differentiation, but not the formation of mononuclear 
osteoclast precursors (Simonet et aJ, 1997; Tsuda et al , 
1997) . 

The interaction between ODF and OPG in the 
formation of osteoclasts is illustrated in Figure 1. 
Osteoclasts are derived from haematopoietic stem cells that 
differentiate along the monocyte/macrophage lineage. 
Mononuclear precursors of osteoclasts are required to come 
into direct or close contact with osteoblasts to be 
rendered capable of differentiating into mature, 
functional, multinucleate osteoclasts. Osteoblasts express 
15 ODF, a membrane -bound protein that stimulates the 

differentiation and formation of multinucleate osteoclasts 
from mononuclear precursors when it binds to its receptor, 
RANK. ODF expression is stimulated by bone-resorbing 
factors such as PTH, PGE 2 , 1 , 25-dihydroxyvitamin D 3 and 
20 interleukins 6 and 11. The action of ODF is antagonised by 
Osteoprotegerin, a soluble factor secreted by osteoblastic 
stromal cells. It binds to ODF to inhibit the formation of 
differentiated multinucleate osteoclasts, but does not 
prevent the formation of mononuclear osteoclast precursors. 

We have now identified a polypeptide factor which 
is able to inhibit formation of mononuclear osteoclast 
precursors from haematopoietic stem cells, and which is 
expressed at least on the cell membranes of osteoblasts. 
It appears that when the molecule is expressed on the 
osteoblast cell membrane it is not secreted. Preventing 
expression of the factor results in increased formation of 
mononuclear precursors of osteoclasts. 

SUMMARY OF THE INVENTION 
35 In a first aspect, the invention provides a 

nucleic acid molecule which comprises a sequence encoding a 
protein which 
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a) is expressed at least on osteoblasts, and 

b) mhibits osteoclast differentiation from 
haematopoietic cell precursors, 

5 undo, , ° r WhlCh hybrid±Ses to said nucleic acid molecule 
are :e rr ent COnditi ° nS - tringent conditions 

Sambrook et al (1989), and Example 2 herein. 

The nucleic acid may be cDNA, genomic DNA or 

10 Tr^" Preferab1 ^ the nucleic acid molecule is a 

10 CDNA. More preferably the cDNA comprises a sequence 

selected from the group consisting of SEQ ID NO- 2 SEO ID 
NO: 4 SEQ ID NO: 7 , SEQ ID NO: 3, SEQ ID NO: S. SEQ ID 
NO: 10 and SEQ ID NO: 11. 

15 of hfl . Preferab1 ^ ^e protein inhibits differentiation 
of hae matopoietic stem cells to osteoclast progenitor 
cells. m a particularly preferred embodiment, the nucleic 
3Cld m ° leCUle ° f the invention comprises a 110 base pa 
sequence as set out in SEQ ID NO: 2. 

20 anti „ Th±S aSPSCt ° f thS inve ^ion also encompasses 
anti-sense sequences directed against the nucleic acid 
no ecule defined above, and particularly 
antx-sense sequence directed against SEQ id NO- 10 
lTN0 e : a 2 b 5 y ^ anti " SenSe SS ™ is SEQ ID NO: 24 or SEQ 

In 3 SeC ° nd as P e ct, the invention provides a 
Polypeptide encoded by the nucleic acid molecule of the 
invention. Preferably the polypeptide is encoded by the 
human cDNA sequence. More preferably the polypeptide 
3Q comprises an amino acid sequence encoded by SEQ ID NO: 20 

antihoH „ 9 thlrd aSP6Ct ' ^ inventi on Provides an 
ant body directed against a polypeptide of the invention 
Preferably the antibody is directed against an epitope 
present in the sequence epitope 
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The antibody may be polyclonal or monoclonal, but is 
preferably monoclonal. Suitable methods for generating 
either polyclonal or monoclonal antibodies are very well 
known in the art. It will be clearly understood that the 
5 invention encompasses biologically-active fragments and 

analogues of such antibodies, including but not limited to 
ScFv fragments, trimeric antibodies, humanised antibodies 
and the like. Again, methods for producing such active 
fragments and analogues are well known in the art. See for 
10 example PCT/AU93 / 0 0491 and PCT/AU97/00212 and references 
cited therein. 

In a fourth aspect, the invention provides a 
composition comprising a polypeptide or an antibody of the 
invention, together with a pharmaceutically-acceptable 
15 carrier. 

Methods and pharmaceutical carriers for 
preparation of pharmaceutical compositions are well known 
in the art, for example as set out in textbooks such as 
Remington's Pharmaceutical Sciences, 17th Edition, Mack 

2 0 Publishing Company, Easton, Pennsylvania, USA. 

In a fifth aspect, the invention provides- ^ 
method of treatment of a condition characterised by 
abnormal bone resorption, comprising the step of 
administering an effective amount of a modulator of 

2 5 expression or function of the polypeptide of the invention. 

Where the condition involves excessive bone 
resorption, the method will comprise administration of the 
polypeptide of the invention or the nucleic acid encoding 
this polypeptide, or a biologically-active fragment or 

30 analogue thereof. Such conditions include, but are not 
limited to, osteoporosis, primary hyperparathyrodism, 
Paget 's disease, rheumatoid arthritis, renal 
osteodystrophy, humoral hypercalcaemia of malignancy, and 
conditions where cancer has metastasised to bone. 

35 Conditions characterised by deficient bone 

resorption include osteopetrosis. Antibodies directed 
against the polypeptide of the invention or anti-sense 
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oligonucleotides directed against the nucleic acid of the 
invention may be used to inhibit the function of the 
polypeptide and thus to increase bone resorption. 

It is also contemplated that the polypeptide of 
the invention may be used to promote healing of bone 
fractures, particularly in individuals where fracture 
healing is delayed or deficient. These include individuals 
suffering from osteoporosis or diabetes mellitus. 

In a sixth aspect, the invention provides a 
diagnostic kit, comprising a reagent selected from the 
group consisting of a nucleic acid of the invention or a 
fragment thereof capable of hybridising to a nucleic acid 
of the invention; an anti-sense nucleic acid of the 
invention; a polypeptide of the invention, and an antibody 
15 of the invention. For example, diagnostic kits for use in 
methods such as polymerase chain reaction, fluorescent in 
situ hybridisation, immunoassay, and the like are 
contemplated. Where appropriate, the molecule of the 
invention which is used may be labelled with a detectable 
marker, such as a radioactive, fluorescent, 
chemoluminescent or enzymic marker. Such diagnostic kits 
are useful for detection of abnormalities in the structure 
expression or control of the factor of the invention, which 
may lead to increased bone resorption and concomitant 
25 pathological manifestations. 

For the purposes of this specification it will be 
clearly understood that the word "comprising" means 
"including but not limited to", and that the word 
"comprises" has a corresponding meaning. 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 summarises the factors and mechanisms 
mvolved in control of osteoclast differentiation and 
development, as understood before the date of the present 
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Figure 2 shows detection by Northern blotting of 
a 780 base pair mRNA species using rOCIL323 as a probe in a 
variety of rat clonal osteoblast-like cell lines. 

Figure 3 shows the results of Northern blot 
analysis using rOCIL402 / a 402 base pair fragment obtained 
by screening of a rat ROS 17/2.8 cDNA library, using the 
polymerase chain reaction. Similar results were obtained 
using rOCIL323 fragment as a probe. 

Figure 4 shows the comparison of rOCIL323 and 
rOCIL402 probes in Northern blotting of mRNA from 1,25- 
dihydroxy vitamin D 3 -treated rat UMR 10 6-06 cells. The 
results showed that both fragments detected the same 
species of mRNA. 

Figure 5 shows the results of Northern blot 
analysis of UMR 201 mRNA using mOCIL 2kb, showing that this 
probe detected the same 780 bp species as rOCIL323 and 
rOCIL402. 

Figure 6 shows the intron-exon structure of the 
mouse OCIL gene 

Figure 7 shows sequence comparisons between 
human, rat and mouse OCIL, as generated using the prpgram 
Clustal W. Regions of sequence identity are shown shaded 
and in bold. 

Figure 8 shows comparisons between the deduced 
protein sequence corresponding to hOCIL and the deduced 
protein sequences corresponding to rOCIL 1.3 and mOCIL, as 
generated using the program Clustal W. 

Figure 9 shows the results of treatment of 
cocultures of primary mouse calvarial osteoblasts and mouse 
bone marrow cells with an anti-sense oligonucleotide, 323 
A/S, directed against the C-type lectin region of OCIL and 
a second antisense oligonucleotide, 474 A/S, which was 
directed against the sequence in the open reading frame but 
outside the C-type lectin region. 

a: cocultures treated with anti-sense 
oligonucleotide under basal conditions. 
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b-d: cocultures stimulated with anti-sense 
oligonucleotides in the presence of 1,25- 
dihydroxyvitamin D 3 and PGE 2 . 

Figure 10 shows the results of Northern blot 
analysis of mRNA from UMR106 parental cells, demonstrating 
upregulation of expression of OCIL by IL-Icc, IL-10 i L -ll 
IL-17, TNFa, TGFp, M-CSF, GM-CSF, PGE 2 , 1 , 2 5 -dihydroxy- ' 
vitamin D 3 , 1 , 25-dihydroxyvitamin D 3 plus PGE 2 , and PGE 2 
plus dexamethasone . 

Figure 11A shows, the results of a time-course 
study, showing upregulation of OCIL by PTHrP . 

Figure 11B shows that the upregulation could be 
detected using either rOCIL402 or mOCIL 2kb as the probe. 

Figure 12 shows upregulation of expression of 
OCIL in primary mouse calvarial osteoblasts by IL-la, 
IL-ip, il-11, dexamethasone, and 1 , 25-dihydroxyvitamin D 3 . 

Figure 13A shows upregulation of expression of 
OCIL m ST2 mouse stromal cells by PGE 2 , dexamethasone 
1, 25-dihydroxyvitamin D 3 , IL-11, PT H, and 1 , 25-dihydroxy- 
vitamin D 3 plus PGE 2 . 

Figure 13B shows the time course of upregulation 
of OCIL expression in ST2 cells by dexamethasone. 

Figure 14 shows the results of Northern blot 
analysis of adult mouse tissues (left panel) and adult rat 
tissues (right panel), demonstrating expression of OCIL. 

Figure 15 shows the effect of recombinant rOCIL 
protein on formation of multinucleate osteoclasts from 
mouse calvarial osteoblasts. 
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DET AILED DESCRIPTION Q F THE I NVENT TDM 

The invention will now be described in detail by 
way of example only, with reference to the following non- 
limiting examples and drawings. 

We set out to clone a gene encoding a peptide 
that would function to prevent osteoclast formation. it is 
known that mature osteoblasts have limited potential to 
support osteoclast formation, and we postulated that mature 
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osteoblasts might express osteoclastogenic inhibitors. The 
pre-osteoblastic cell line UMR201 can be differentiated to 
a more mature osteoblastic phenotype by treatment with 
10" 6 M retinoic acid for 24 hr (Ng et al, 1988) . mRNA 
species differentially expressed between mature osteoblasts 
(retinoic acid-treated UMR201 cells) and immature 
osteoblasts (untreated UMR201 cells) were identified using 
an array of oligonucleotide primers in reverse 
transcription PCR, where products amplified from RNA from 
the two cellular populations were compared. We 
characterised products which were elevated in mature 
osteoblasts as candidates for osteoclastogenic inhibitory 
molecules . 

Abbreviations used herein are as follows: 
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GM-CSF 

hPTH 
IGF 
IL 
LIF 

M-CSF * 

1,25 (OH) 2 D 3 

OCIF 

ODF 

OPG 

PCR 

PGE2 

PTH 

PTHrP 

RANK 

TGF 

TNF 

TRAP 

Example 1 



granulocyte/macrophage colony stimulating 
factor 

human parathyroid hormone 
insulin like growth factor 
interleukin 

leukaemia inhibitory factor 

macrophage colony stimulating factor (CSF-1) 
1, 25-dihydroxy vitamin D 3 
osteoclastogenesis inhibitory factor 
osteoclast differentiation factor 
osteoprotegerin 
polymerase chain reaction 
prostaglandin E 2 
parathyroid hormone 

parathyroid hormone-related protein 
Receptor activator of NF-KB 
transforming growth factor 
tumour necrosis factor 
tartrate-resistant acid phosphatase. 

Isolation of Rat cDNA Encoding the Inhibitory 

Factor 
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Total RNA was isolated from retinoic acid-treated 
preosteoblastic UMR201 cells using guanidine thiocyanate 
(Chomczynski et a2, 1987) . First strand cDNA was 
synthesised from 2 U g of total RNA by incubating for 1 h at 
5 42o C with 15 units of AMV reverse transcriptase (Promega, 
Madison, WI) following oligo dT priming, a sense primer 
that was complementary to rat calcitonin cDNA, designated 
primer CT1 (SEQ ID NO : 1 ) , 

10 CT1 S'-ATG CTG GGC ACG TAC ACA CAA-3 ' 

and 3'UAP 5--CUA CUA CUA CUA GGC CAC GCG TCG ACT AGT AC-3 • 
(Clontech) were used as primers in polymerase chain 
reaction (PGR). The PGR conditions utilised a touchup PGR 
protocol with denaturation at 94° C for 5 min, and then 5 
cycles at 94°C for 1 min, 37 o c for 1 min and 7 2 o C for 1 
man, followed by 35 cycles of 9 4 o C 1 min , 49°C for 1 min 
and 72 o C for 1 min. For these experiments, Expand High 
Fidelity PGR system (Boehringer Mannheim) was used in a 
Perkm Elmer Cetus 480 thermal cycler. A 321 bp PGR 
product was obtained. This 321 bp fragment, which-we 
designated rOCIL323 (SEQ id NO: 2), was used as a probe in 
Northern blots. As shown in Figure 2, it hybridised to a 
780 bp mRNA species in UMR 201, UMR 201-10B, UMR 10 6-06 
UMR 106-01 and ROS 17/2.8 cells, all of which are rat ' 
clonal osteoblast-like cell lines. 

Since retinoic acid and PTH enhance OCIL mRNA 
expression dramatically in UMR 106-06 and UMR 106 parental 
cells, a similar RT-PCR procedure was carried out using RNA 
isolated from retinoic acid or hPTH 1-34 treated UMR 106 
parental cells. a PGR product identical to the 321 bp 
fragment for rOCIL323 was obtained and its expression was 
found to be upregulated in UMR 106 cells treated with 
either retinoic acid or hPTH 1-34. 

To extend the sequence of OCIL, anchored PGR was 
used to screen a rat ROS 17/2.8 cDNA library with Xgtll 
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arms. An antisense 21 bp primer, designated OCILrl (SEQ ID 
NO : 3 ) , 

OCILrl 5'-TGA GTG TTG TCT GTC CAC TTC CAA G-3 ' 

complementary to a sequence in the 321 bp fragment, was 
used with either the Xgtll forward primer 

5--GGT GGC GAC GAC TCC TGG AGC C-3 • or Xgtll reverse primer 
5' -GAC ACC AGA CCA ACT GGT AAT G-3' (Clontech) to amplify 
an aliquot no 6 plaque forming units) of the recombinant 
library. Cycling parameters were 94°C for 5 min, then 80 
cycles of 94°C for 30 sec, 60°C for 30 sec, and 72°C for 
2 min, followed by a final extension step of 72°C for 10 
min. A 402 bp fragment was obtained with Xgtll reverse 
15 primer as the anchored primer. Sequencing of this 402 bp 
fragment showed 88.6% identity over a length of 97 bp with 
rOCIL323 (SEQ ID NO: 2). The 402 bp fragment, designated 
rOCIL402, whose sequence is set out in SEQ ID NO:. 4, was 
used to probe Northern blots obtained from the rat 
20 osteoblast-like cell line. It hybridised to the same 
780 bp mRNA species observed with the rOCIL323 probe^ 
These results are shown in Figures 3 and 4. The same 
results were obtained in both the presence and absence of 
1 , 2 5-dihydroxyvitamin D 3 . 
25 A3' Rapid Amplification of cDNA Ends (3' -RACE) 

strategy was used to obtain the 3' ends of the cDNA of 
interest. The sense specific primers used were OCILrll 
(SEQ ID NO: 5) 

30 OCILrll 5'-GAA ACA TCC CCC TGG AGT ATC C-3' 
and OCILrl2 (SEQ ID NO: 6) 

OCILrl2 5 '-CCA AGT AAC TGG ACA TTG AGC CAG A-3 ' 
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complementary to sequences within rOCIL402 (SEQ ID NO : 4) 
First-strand cDNA was synthesised from total RNA isolated 
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from hPTH 1-34 treated UMR 106 parental cells, using the 
ol igo dT-anchor primer. The cDNA was further amplified by 
PGR using OCILrll or OCILrl2 and the 3' adaptor primer AP-1 
(Clontech) . PCRs were run at 9 4 o C for 5 min, then 35 cycles 
of 94°C for 30 s, 62°C for 30s, and 7 2 o C for 2 min, 
followed by a final extension step of 72 °C for 10 min 
Three different polyadenylated 3' sequences were obtained 
designated rOCIL1.3kb (SEQ ID NO: 7), rOCIL738bp (SEQ ID ' 
NO: 8) and rOCIL620bp (SEQ ID NO: 9) respectively. The 
region of sequence identity between rOCIL323 and rOCIL402 
was found to extend to 117 bp. 



Example 2 Isolation of Mouse cDN A and qDNA Encoding th P 

Inhibitory Factor 
rOCIL402 was labelled with [ 32 P] a-dCTP by using 
the Random Primer labelling kit (Boehringer Mannheim), and 
a mouse liver cDNA library was subjected to hybridisation 
screening at 65°C in a hybridisation buffer containing 
4 x SSPE (SSPE contains 0.15 M NaCl, 0.01 M NaH 2 P0 4 , and 
0.001 M EDTA) , 5 x Denhardt ' s solution, 0.5% sodium dodecyl 
sulfate (SDS) for 24 nr. The filters were then washed 
sequentially in 2 x SSC at 65 o C for 15 min, 2 x SSC with 
0.1% SDS at 65°C for 30 min, and finally 0.1 x SSC at 65° C 
for lOmin. We obtained a 1907bp mouse cDNA, designated 
25 mOCIL2kb (SEQ ID NO: 10). The sequence of mOCIL2kb shows 
80% identity over a length of 461 bp to that of rOCILl 3kb 
When used as an antisense riboprobe in Northern blot 
analysis, mOCIL2kb hybridised to a 780 bp mRNA species in 
UMR201 as detected by rOCIL323 and rOCIL402,as shown in 
3 0 Figure 5. 

A cDNA fragment corresponding to the nucleotides 
58-776 of mOCIL2kb was used as a cDNA probe to screen a 
genomic BAG Mouse I Hybridisation library. The screening a 
was performed under contract by Genome Systems, Inc 
According to their protocol, the cDNA fragment was labelled 
with [ P ] a-dCTP by random primer labelling and the 
lxbrary was screened under the hybridisation condition of 
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55°C in a hybridisation buffer containing 5.5 x SSC, 5 x 
Denhardt's solution, 0.5% SDS and 0.5 x HEPES buffer for 18 
hr. The filters were then washed sequentially in ImM Tris- 
HC1 pH 8.0 and 1% sarkosyl for 15 min, and 3 times in ImM 
Tris-HCl pH 8.0 for 15 min washes at room temperature. 
Eight positive clones were isolated, of which four 
positive clones were screened. After the genomic DNA was 
digested with Hindlll or BamHI, Southern blot analyses were 
carried on with the same cDNA probe. The results showed 
that two clones were identical, and one clone could not 
grow in culture. One of the duplicate clones, designated 
db. 20149, was sequenced by subcloning into the pBS vector 
and by direct sequencing of the genomic clone using cycle 
sequencing. Sequencing of 9349 bp of the genomic clone of 
mOCIL was completed (SEQ ID NO: 11) . It contains 6 exons, 
as shown in Figure 6. The 5' flanking region adjacent to 
exon I is a GC-rich region, containing a Sp 1 binding site. 
A 1 kb gap between nucleotides 1862 and 1868 is currently 
being sequenced. The genomic sequence confirmed that 
nucleotides 1-32 0 of the original mOCIL2kb represented an 
inverted repeat of the 3* end of the sequence, and that 
nucleotides 90 0-1907 of mOCIL2kb were not part of the mOCIL 
sequence. The corrected full length mOCIL cDNA is 9 90 bp 
in length (SEQ ID NO: 12) . 

To confirm the mOCIL cDNA sequence, RT-PCR was 
carried out using total RNA isolated from ST2 mouse stromal 
cells, primary mouse calvarial osteoblasts and mouse liver 
tissue. The sense primer represented nucleotides 18-36 of 
mOCIL and is designated as primer OCILm47 (SEQ ID NO: 13), 

OCILm47 5'- TCC CAT GCC AGA TTG CTT G-3 ' 

The antisense primer represented nucleotides 743- 
72 0 of mOCIL and is designated primer OCILml2 (SEQ ID 
NO : 14), 

OCILml2 5'-GGG ACC ATA GGG GAA AAA GTA G-3' 
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^ at 94 ° C f ° r 5 min ' then 35 cycles 

of 94 C for 30s, 6 0 o C for 30s, and 7 2 o C for 1 min, followed 
by a fxnal extension step of 72°C for 10 min. Five clones 
containing a 721 bp fragment were obtained from three 
sources, ST2 mouse stromal cells, primary mouse calvarial 
osteoblasts and mouse liver tissue. When compared to the 
mOCIL sequence there was 100% identity in the first 106 bp 

exons I and II) but only 90.5% identity in the remaining 
615 bp. This 721 bp fragment was designated as mOCIL47 
(SEQ ID NO: 15) . 

To confirm whether nucleotides 32 0-434 of 
mOClL2kb are part of mOCIL, a sense primer representing 
nucleotides 343-364 of mOCIL2kb, designated as primer 
OCILml7 (SEQ ID NO: 16), 

OCILml7 S'-TGG AAA CTC AGC TCC TCA OCT CTG-3 ' 

and antisense primer OCILml2 was used to carry out RT-PCR 
wxth RNA from three sources, ST2 mouse stromal cells 
prxmary mouse calvarial osteoblasts and mouse liver tissue 
as above. PGR was run under the same conditions as for 
mOCIL47 Twelve clones were obtained, each containing a 
713 bp fragment. The first 115 bp were not identical to 

/ n 2 ° f the 12 Clones < ^ere was 100% identity to 
mOCIL after the first 115 bp. m the other 10 clones, 
there was 92.2% identity to mOCIL and 89.2% identity to 
mOCIL47 after the first 115 bp. This sequence is 
designated mOCIL17 (SEQ ID NO: 17). 

RT-PCR was also carried out using a sense primer 
corresponding to the region located at the junction of exon 
III and exon IV, representing nucleotides 243-2 67 of mOCIL 
and designated primer OCILm32 (SEQ ID NO: 18), 

35 OCILm32 5 ' - TTT GTC AGC AAC AAA GAC AGA ACA G-3 ' 
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Primer OCILml2 was used as a antisense primer. 
RT-PCR was carried out with RNA from three sources, ST2 
mouse stromal cells, primary mouse calvarial osteoblasts 
and mouse liver tissue, as above. PCR was run under the 
5 same conditions. Four clones were obtained, each 

containing a 502 bp fragment. Three of the four clones 
have 100% identity to mOCIL17 and one of the four clones is 
100% identical to mOCIL. 

mOCIL has an open reading frame encoding a 
10 218 amino acid protein. its structure is typical of a type 
II membrane protein, with a predicted 142 amino acid 
extracellular domain, a 21 amino acid transmembrane domain, 
and a 55 amino acid cytoplasmic domain. The extracellular 
domain has 6 cysteine residues. There are three potential 
15 N-linked glycosylat ion sites at residues 86, 112 and 207, 
all of which are in the extracellular domain. There is a 
myristylation motif in the intracellular domain. 

Comparison of the putative protein sequences 
derived from the rOCIL323, rOCIL1.3kb and mOCIL cDNA 
sequences with the SwissProt protein database indicated 
that the mOCIL protein sequence included an 80 amino acid 
C-lectin type motif, from positions 92 to 171 in the mOCIL 
protein sequence. This C-lectin motif is similar to that 
of CD69, a membrane -bound lectin expressed by bone marrow 
2 5 haematopoietic cells, and thought to be involved in 
monocyte differentiation. C-lectin motifs are also 
involved in cell-cell contact and lipid binding (Sharon and 
Lis, 1995; Gabius 1997; Kieda, 1998). 

MOCIL47 shows 100% identity to a cDNA clone encoding a C- 
type lectin expressed in bone marrow cells, which was 
deposited in the Gen Bank data base on 29 March 1999 
(Accession No. AF121352). The full length mOCIL cDNA (SEQ 
ID NO: 12) is 92% homologous to this. 

The three different sequences (SEQ ID NO: 12, 15 
35 and 17), which overall have 90% identity, may represent 

gene duplications, where either one or all three sequences 
may have similar biological outcomes. The functional data 
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10 



we have to date, relating to the inhibition of osteoclast 
formation from haemopoietic precursor cells using antisense 
oligonucleotides (SEQ id NO: 1 and 13), have been obtained 
mainly with mOCIL17 ( SEQ ID NO: 7) . 

Example 3 Isolation of Human cDNA Encoding t-h*> 

Inhibitory Factor 
t 32 P] CC-dCTP labelled rOCIL402 was used to probe 
a human fetal cDNA library under low stringency 
hybridisation conditions at 5 5 o C in a hybridisation buffer 
containing 4 x SSPE, 5 x Denhardt < s solution, 0.5% SDS for 
24 hr. and then washed with low stringency at 2 x SSC with 
0.1% SDS at 40o C for 15 min, and 1 x SSC/0.1% SDS at 40°C 
for 15mm. Eight positive clones were obtained after 
15 tertxary screening. Clone No. 6 is a 1.3 kb cDNA segment 
whose sequence was designated hOCIL (SEQ ID NO: 19) The 
putative protein sequence encoded by bp883-1059 was a C- 
type lectin moiety, which showed 73% homology to the C-type 
lectin sequence previously demonstrated in rOCIL323 
rOCIL1.3kb and mOCIL2kb. However, regions of amino 'acid 
sequence 5- and 3' to this C-type lectin domain were 
different from those of the mouse and rat sequences /as 
shown in Figure 6 . 

Clone No. 8 is 960 bp long. it has a 64% 
identity over a length of 145 bp with rOCIL402 . A search 
of the EST database showed that clone No. 8 has 99 5% 
sequence identity with a human EST clone. Accession No 
AA029932, over a published length of 209 bp. This EST 
clone was ordered and further sequenced. The EST clone is 
680 bp m length, and has 64% identity with rOCIL1.3kb over 
a length of 343 bp. it also has 64% identity over a length 
of 346 bp compared to mOCIL. rt-PCR showed that clone NO 
8 and AA029932 represent overlapping clones, which are 
contiguous, and combine to represent a human OCIL (hOCIL) 
of 1336 bp in length ( SEQ ID NO: 20) . 

The deduced protein sequence showed 56% homology 
to the deduced protein sequence of rOCILl . 3kb, and 62% 

H.xPcl.r^Mteepxspeci.NST Vincents osteoclast .doc 14/07/99 
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homology to that of mOCIL, as shown in Figure 7. These 
differences are principally at the N-terminal. Although 
there is 80% homology between the mouse and human OCIL 
proteins in certain regions, this indicates that the mouse 
cDNA could not reliably be used to isolate a human genomic 
DNA encoding hOCIL. 

In order to obtain the hOCIL gene, the 680 bp 
cDNA insert of clone AA029932 was isolated and screened by 
Genome Systems, Inc. against the genomic BAG Human Release 

11 Hybridisation library, as described in Example 2. One 
positive clone was obtained and 7kb of this clone was 
sequenced. This sequence has 100% identity to a human 
genomic clone, which was deposited in the GenBank database 
on April 6, 1999 (Accession No. AC007068) . These 
sequences, when combined, indicate that the hOCIL gene is 
greater than 10 kb in length (SEQ ID NO: 21) . The genomic 
sequences corresponding to the 5' flanking region, promoter 
region and the first 654 bp of cDNA sequences are not 
represented in the sequence deposited in Accession No. 
AC007068 . 

The hOCIL gene is located in chromosome 12,12p. Chromosome 

12 and chromosome 11 are .considered to be evolutionarily 
related. There are several examples of evolutionarily 
related proteins whose genes are located on chromosome 12 
and chromosome 11 such as PTH and PTHrP, IGF and IGF I , 
Harvey ras sarcoma 1 and Kisten ras sarcoma 2 etc. (Martin 
et al . , 1991). Thus chromosome 11 and chromosome 12 share 
genes of similar biological characteristics with redundant 
function . 

Example 4 Effect of Anti-Sense Oligonucleotides on 

Transcription 
Primary mouse calvarial osteoblasts were 
cocultured with mouse bone marrow cells to generate 
mononuclear and multinucleate osteoclasts. Staining for 
tartrate-resistant acid phosphatase (TRAP) , performed using 
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a commercial leukocyte acid phosphatase kit from Sigma 
Diagnostics (St. Louis, MO, USA) (Katsogiannis et a J , 
1998), identified these cells as osteoclasts. Under normal 
conditions, multinucleate functional osteoclasts are not 
formed unless the cocultures are stimulated with 1,2 5- 
dihydroxyvitamin D3 and PGE2 - 

Experiments were carried out to block translation 
of OCIL mRNA in order to determine the function of its 
translated product. Primary mouse calvarial osteoblasts 
were treated with antisense oligonucleotides. Four 
antisense oligonucleotide sequences were designed. Two of 
these antisense oligonucleotide sequences were 
complementary to the C-type lectin region, and designated 
323 (SEQ ID NO: 22) and 402 (SEQ ID NO: 23) respectively: 

323 5 '-GAG TGT TGT CTG TCC ACT TCC-3 ' 

402 5'-TTT CCA ACT CCA ATC CAG TTT-3 ' 

20 The 323 antisense oligonucleotide has 19 of 21 bp 
complementary to mOCIL47 . 



S.-^TG^^CACTTCC-S- 3 2 3 a „ cis ense 

3 ' -GTCACAACAAACAGGTGAAGG-5 ' mOCIL47 Strand + 



30 



35 



and has 20 of 21 bp complementary to mOCIL and mOCIL17 : 

5'-GAGTGTTGTCTGTCCACTTCC-3' 323 antisense 
I I M I I I I 1 I I I I l 1 1 1 1 1 1 

3 ' -GTCACAACAGACAGGTGAAGG-5 ' mOCIL17 Strand + 

The 402 antisense oligonucleotide has 100% complementarity 
to mOCIL and has 2 0 of 21 bp complementary to mOCIL47 and 
mOCIL17 : 
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5 ' -TTTCCAACTCCAATCCAGTTT-3 ' 402 antisense 

M I I I I I I I I I I I I I I I I I I 
3 ' -AAAGGTTGAGGTCAGGTCAAA-5 ' mOCIL47 Strand + 

5 '-TTTCCAACTCCAATCCAGTTT-3' 402 antisense 
Ml I I I I I I I I I I | | | | | | | 

10 3 ' -AAAAGTTGAGGTTAGGTCAAA-5 ' mOCIL17 Strand + 

The other two antisense oligonucleotide 
sequences, respectively designated 43 9 ( SEQ ID NO: 24) and 
15 474 (SEQ ID NO: 25), specifically inhibit the translation 
of mOCIL17. 

The oligonucleotide 43 9 is antisense to the sense 
primer OCILml7 and located upstream of the open reading 
20 frame: 

43 9 5* GAG GAG CTG AGT TTC CAC TAC-3 ' 



25 



30 



40 



The antisense oligonucleotide 474 is 
complementary to a region in mOCIL17 in the open reading 
frame but outside the C-type lectin region: 

474 5'-GGT AGG GAA GCC TTT GTG AC-3 ' . 



Under basal conditions, ie. in the absence of 
stimulation with 1 . 2 5-dihydroxyvitamin D 3 and PGE 2 , there 
was a 3- to 5-fold increase in the number of mononucleate 
TRAP-positive cells in the cocultures treated with the 323 
and 474 antisense oligonucleotides over the period from 3 
35 to 7 days. With the 402 antisense oligonucleotide, a 

4.5-fold increase in the formation of mononucleate TRAP 
positive cells was observed after 7 days treatment. 
Multinucleate TRAP-positive cells (323; 4.5 ± 2, 474; 
4.25 ± 1.25) were also observed in cocultures treated with 
both 323 and 474 antisense oligonucleotides at a 
concentration of 5um, whereas none were observed in the 
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control. These experiments were performed three times, and 
a representative result is shown in Figure 9a. 

When the cocultures were stimulated with 
1,25-dihydroxyvitamin D 3 and PGE 2 , multinucleate TRAP- 
positive osteoclasts were formed after 7 days. Treatment 
with 5 UM 323 antisense oligonucleotide resulted in a 
seven-fold increase in the number of multinucleate 
osteoclasts, as shown in Figure 9b. 

Treatment with 10 |0M 402, 5 MM 439 and 474 
antisense oligonucleotide resulted in a 2 to 3 -fold 
increase in the formation of multinucleate osteoclasts 
after 7 days, as shown in Figures 9c and 9d. 

These TRAP-positive cells were further 
characterized as osteoclasts by the presence of receptors 
for calcitonin, demonstrated using autoradiography and 
immunostaining, and by the ability of these cells to form 
resorption pits' in bone slices. 

Effects of mOCIL antisense oligonucleotides on 
the three phases of osteoclast formation were also 
20 investigated. Mouse bone marrow and primary osteoblastic 
cells were cocultured in the absence of 
1,25-dihydroxyvitamin D 3 and PGE 2 for a 7 day culture' 
period. 323 and 474 antisense oligonucleotides were added 
for 3 phases of culture: the first phase (0-3 days) in 
which there is proliferation of osteoclast progenitors, the 
second phase (3-5 days) and the final phase (5-7 days) in 
which these cells differentiate into mature osteoclasts 
TRAP-positive osteoclasts were counted. In order to 
examine the role of OCIL on the bone resorptive activity of 
mature osteoclastic cells, the cells were also cultured on 
dentine slices under the same culture conditions as above 
and resorption pits formed on dentine slices were 
quantitated. The results are shown in Table 1, and 
indicate that the OCIL acted at an early stage in 
35 osteoclast formation. 
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EXample 5 Regulation o f_ expression of QCIL mRNA 

The regulation of OCIL mRNA expression was 
examined in the UMR106 parental osteoblast-like cell line 
using rOCIL402 as a probe. As shown in Figure 10 
expression of the mRNA was upregulated by retinoic acid 
(RAJ, parathyroid hormone (1-34), parathyroid hormone 
related protein (1-34), TNF-oc, interleukin la (il-1 a) 

Tl\ ; -^T' IL ~ 17 ' GM " CSF ' M ~ CSF ' TCFP ' ^xamethasone, 
1,25-dihydroxyvitamin D 3 and prostaglandin E 2 . A time 
course study, illustrated in Figure 11, showed that 
parathyroid hormone-related protein (1-34) increased levels 
of OCIL m*NA as early as 1 hour, peaking at 4 hours and 

maintaining the hiah lpvei ^ ~ 

gn xeve l of expression until 48 hours 

As shown in Figure 12, in primary mouse calvarial 
osteoblasts, OCIL mRNA was upregulated by IL-1 a IL-lp 
IL-11, 1,2 5-dihydroxyvitamin D 3 and retinoic acid. In ST2 
mouse stromal cells, OCIL mRNA was upregulated by 
dexamethasone, 1,25- dihydroxyvitamin D 3 and IL-11 The 
time course also showed that dexamethasone increased OCIL 
m*NA at 1 hour, peaking at 2 hours and returning a basal 

ZanTZ hOUrS ' ThSSe rSSUltS d in- Figures 

liA and 13B, respectively. 

Example 6 Localizati on of oc Tr , mBMa 

mRNA encoding OCIL was localised in fetal 
newborn and adult mouse cissue by ^ hybridlsa ; 
usxng the rOCIL 402 antisense probe, using a method 
described previously (Katsogiannis et al. 1997 and 1998) 
Plasmrd CDMA was labelled with digox.genin (DM) using an 
RNA labellrng krt (Boehringer-Mannheim, Mannheim GmbH 
Germany, . Hybridisation signals were detected by alkaline 
Phosphatase staining „ ith BCIP/NBT a£ter incubatiQn 

Thrm^r 19 " 1 " " tib0dy C ° UPled t0 alkaline Phosphatase. 
Z TabTe 2 """"^ " * ° f - summarized 



tr vi.r«_E*rrs Osteoclasc.doc 14/07/99 




- 24 - 



Table 2 

Adult Rat Tissues Probed with rOCIL402 





li — : " — s 

|| Tissue 


OCIL mRNA 


Calvaria 


Osteoblasts 


+ + + 




Marrow hematopoietic 


+ + 




cells 






Megakaryocytes 


+ + 


Kidney 


Medulla 


+ + + 




(collecting tubules) 






Outer cortex 


+ 




(collecting tubules only) 






Glomeruli (endothelial 


-ve 




cells only weakly 






positive) 






Proximal/distal tubules 


-ve 


Lung 


pneumocytes 


+ + 




bronchial epithelium 




Brain 


Neurones in cerebral 


+++ 




cortex, cerebellar 






cortex, hippocampus ; 






choroid plexus 




Heart- 


Cardiac muscle 


+++ 


Spleen 


White pulp 


+++ 




Cortex 


+ 




Red pulp 


-ve 


Gut 


Luminal epithelium 


++ 


Liver 


Hepatocytes 


-ve 



5 

In situ hybridization was also carried out to 
detect OCIL mRNA localisation in adult rat tissue and human 
skin, using the same method, and the results are summarized 
10 in Table 3 . 
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Table 3 

Normal Murine Tissues Expressing OCIL mRNA 



Fetal 


Newborn 




(day 15) 


(day 1) 


(5 




(+) 
(++) 

(+++) 
(-) 

(na) 



denotes weak signal 
denotes moderate signal 
denotes strong signal 
denotes absence of signal 
not applicable 



10 



15 



- 26 - 

(nd) not determined. 

* (mk) megakaryoblast of fetal liver. 



OCIL mRNA localization in human skin probed with OCIL402 : 

Epidermis (all layers) +++ 
**Basal layer slightly weaker signal 

In Northern blot analyses of adult mouse tissues 
using mOCIL2 as the probe, OCIL mRNA was shown to be 
expressed in heart, skin, lung, liver, kidney, gut and 
brain. In adult rat, OCIL mRNA was found to be expressed 
in brain, bone, lung, liver, gut, kidney, mouse, skin and 
heart. These results are illustrated in Figure 14. 



Example 7 Antibody Directed Against OCIL 

The following peptide fragment of the deduced 
amino acid sequence derived from the cDNA sequence of OCIL 
was synthesized, and was used to immunize rabbits, using 
2 0 standard protocols. 



H-Cys-Met-Arg-Thr-Glu-Ala-Gln-Leu-Ala-Arg-Phe-Asp-Asn-Gln- 
Asp-Glu-Leu-Asn-Phe-OH (SEQ ID NO: 26) 

25 The antibody raised may be used to detect the 

OCIL protein, using standard immunohistochemical methods, 
or to neutralize OCIL activity in murine co-cultures to 
stimulate osteoclast formation. 

3 0 Example 8 Immunohi s t ochemi stry 

A rabbit polyclonal antibody prepared as 
described in Example 7 was used for immunohi st ochemi s t ry . 
A kit for the standard peroxidase-labelled s treptavidin- 
biotin detection method (DAKO, Boenisch, 1989) was used 

35 according to the manufacturer's instructions with minor 
modifications. The dilution of the antiserum used was 
optimised in preliminary experiments. Incubation of tissue 
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sections with a 1:100 dilution of the primary antiserum was 
earned out overnight at 4°C in a humidified chamber 
Peroxidase activity was detected with 3 • -3 ' -diamino- 
benzidine tetrahydrochloride (Sigma) and 0.15% H 2 0 2 
Slides were counterstained with haematoxylin, dehydrated 
and mounted on a coverslip. The results are summarized in 
Table 4. 

Table 4 

Normal Murine Tissues Expressing mOCIL protein 




Extraskeletal tissues 
Brain 
Lung 
Heart 
Kidney 

(collecting tubules) 
Small Intestine 
Liver 

Skeletal muscle 

Skin 

Spleen 



nd 
nd 
nd 
nd 

nd 
nd 
nd 
nd 
nd 



++ 
++ 

nd 
nd 
++ 

nd 



+++ 
+ 

+ + 

nd 
•nd 
+ + 
++ 
nd 



Skeletal tissues/cells 
Long bone 
chondrocy t es 
osteoblasts 
osteoclasts 

perichondrium/periosteum 
mar row/megakaryocvt es 



(+) denotes weak signal; 

(++) denotes moderate signal; 

(+++) denotes strong signal; 



nd 
na 
nd 
nd 
na 



+++ 
nd 
+ + 



+ + + 
nd 
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(— ) denotes absence of signal; 
(na) not applicable; 
(nd) not determined. 



10 



Example 9 Recombinant OCIL protein 

OCIL proteins were prepared by recombinant DNA 
technology to allow more extensive laboratory studies of 
their actions on osteoclast formation as well as osteoblast 
function. Soluble mouse and rat OCIL cDNA tagged at the N- 
terminus with the FLAG epitope were constructed in the pEF- 
BOS Mammalian expression vector (Mizushima & Nagata 1990), 
which had been modified to contain an in-frame IL-3 signal 
sequence and FLAG peptide coding sequence (gift of Dr. D 
Hilton) . 

15 In order to obtain a RT-PCR product encoding 

the mOCIL extracellular domain (amino acids 77-218) to 
clone into the Mlul site of the vector, as shown in Figure 
6, the RT-PCR was carried out using total RNA isolated from 
primary mouse calvarial osteoblasts, which support 

20 osteoclast differentiation in coculture. A sense primer, 
OCILm33, comprising OCILm32 representing nucleotides, 243- 
267 of mOCIL and containing a Mlul site, designated primer 
OCILm33 (SEQ ID NO : 27) : 

25 OCILm33 5 ' -GCC ACG CGT TTG TCA GCA ACA AAG ACA GAA CAG-3 ' 

and an antisense primer comprising OCILml2 (SEQ ID NO: 4), 
representing nucleotides 743-720 of mOCIL and containing a 
Mlul site, designated primer OCILm46 (SEQ ID NO: 28), 



30 



35 



OCILm46 5' -GCC ACG CGT GGG ACC ATA GGG GAA AAA GTA G-3 ' 

were used as primers in the PCR . PCR was run at 94°C for 
5 min, then 35 cycles of 94°C for 30 s, 60°C for 30s, and 
72°C for 1 min, followed by a final extension step of 72°C 
for 10 min. A 501 bp fragment was obtained and further 
cloned into the expression vector pEF-BOS . The open 
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reading frame and FLAG fusion was confirmed by sequencing 
(bp 1-132), and the 501 bp fragment sequence (SEQ ID 
NO: 29) was confirmed to be identical to mOCILl? . 

To obtain a PCR product encoding the rOCILl 3kb 
extracellular domain (amino acids 40-179), a sense primer 
to represent nucleotides 126-146 of rOCIL1.3kb with the 
Mlul site, designated primer OCILr22 (SEQ ID NO: 30), 

OCILr22 5'- GCC ACG CGT TCA GTA AAA AAG ACA GCC AAG-3 ' 

and an antisense primer representing nucleotides 544-526 of 
rOCIL1.3kb with the Mlul site, and designated primer 
OCILr23 (SEQ ID NO: 31), 

15 OCILr23 5 ' -GCC CAG CGT AAC TAC AGG CAC TGT GAG G-3 • 

were used as primers in a PCR. PCR was carried using 
rOCIL1.3 kb plasmid as template and run at 94°C for 5 min 
then 35 cycles of 94°C for 30 s, 60°C for 30s, and 72°C for 
1 nun, followed by a final extension step of 72°C for 
10 min. A 421 bp fragment was obtained and cloned-Lnto the 
expression vector pEF-BOS . The open reading frame and FLAG 
fusion were confirmed by sequencing. 

HEK 293 cells were transfected with both mouse 
and rat expression constructs using Lipof ectamine (Life 
Technologies, Inc) . Supernatant was harvested after 
72 hours. The recombinant protein was purified by 
mcubation with the anti-FLAG M2 affinity gel (Kodak), and 
eluted with the FLAG peptide (Kodak) as outlined in the 
manufacturer's protocol. The purified protein was used to 
study lt s effects on osteoclast formation in murine 
cocultures . 

An experiment was also carried out to determine 
the action of rOCIL protein in osteoclast formation 
Primary mouse calvarial osteoblasts were cocultured with 
mouse spleen cells obtained from 15 week-old mice and 
stimulated with 1 , 25-dihydroxyvitamin D 3 and PGE 2 in the 



35 
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presence of rOCIL protein for 10 days. A negative control 
was carried out as with protein carrier buffer alone. As 
shown in Figure 15, rOCIL protein significantly reduced the 
number of osteoclasts formed when compared to the presence 
5 of carrier buffer alone. 

DISCUSSION 

We conclude from our experiments that in 
osteoblasts OCIL is expressed on the cell surface as a type 

10 II membrane peptide. Contact with haematopoietic precursor 
cells prevents further differentiation into mononucleate 
osteoclast precursors, and ultimately into functional 
multinucleate osteoclasts. Without wishing to limit the 
scope of the invention by any proposed mechanism, we 

15 consider that upregulation of OCIL mRNA expression by the 

same osteotropic factors that increase expression of ODF is 
consistent with the hypothesis that regulation of bone 
resorption by osteoclasts is tightly regulated. According 
to this hypothesis, stimulation of multinucleate osteoclast 

2 0 formation through ODF would simultaneously prevent the 

generation of new osteoclasts through the action of 1 OCIL. 
If this system is operative under normal physiological 
conditions, then bone resorption becomes a self -limiting 
process . 

25 In vivo, OCIL has the potential to be used as a 

therapeutic agent in the treatment of conditions which are 
characterised by excessive bone resorption, such as 
osteoporosis, primary hyperparathyroidism, Paget ' s disease, 
rheumatoid arthritis, renal osteodystrophy, and humoral 

30 hypercalcaemia of malignancy, as well as metastatic bone 
disease. Modulation of the expression or function of the 
factor may also be useful in the treatment of disorders 
involving extra -skeletal calcification . 

35 It will be apparent to the person skilled in the 

art that while the invention has been described in some 
detail for the purposes of clarity and understanding, 
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various modifications and alterations to. the embodiments 
and methods described herein may be made without departing 
from the scope of the inventive concept disclosed in this 
specification . 



References cited herein are listed on the 
following pages, and are incorporated herein by this 
reference . 
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SEQUENCE ID LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 

5 (A) NAME: ST VINCENT'S INSTITUTE OF MEDICAL 

RESEARCH 

(B) STREET: 41 VICTORIA PARADE 

(C) CITY: FITZROY 

(D) STATE: VICTORIA 

10 (E) COUNTRY: AUSTRALIA 

(F) POSTAL CODE ( ZIP) : 3 065 

(ii) TITLE OF INVENTION: INHIBITOR OF OSTEOCLAST 
PRECURSOR FORMATION 

15 

(iii) NUMBER OF SEQUENCES: 31 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Versic 
#1.30 (EPO) 



25 (2) INFORMATION FOR SEQ ID NO: 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

15 

ATGCTGGGCA CGTACACACA A o n 
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( 2 ) INFORMATION FOR SEQ ID NO : 2 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

CGCTCTAGCC CGGCCACGCG TCGACTAGTA CAGCTCCAAA TCTGTGCCCC 
TCAGTTCCTC 60 

CCTCCTGTTA TCTC TAG AGG AAGCTGTGGA G AG ATTC C AG GATCATCTGA 
AACAGAGACA 12 0 

CATGCATTCT CGGC TTTTTG TGTTTTATTA C AG AATTTC T TAAGCAGATA 
CAAAGGGAGT 180 

TTTGATTACT GGATCGGCCT GCACAGAGAG TCCTCAGAGC ACCC TTGGAA 
GTGGACAGAC 240 

AACACTCAGT ATAACTACTC GTATGTTTCA CAATGTTTTT TCTTCTACTG 
TGTTCATGTC 300 

TTGTTGAGGT CTTGTGTGTA C 321 



( 2 ) INFORMATION FOR SEQ ID NO : 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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- 3 - 



(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
TGAGTGTTGT CTGTCCACTT CCAA 24 



(2) INFORMATION FOR SEQ ID NO : 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACAGTAAAAT GCTCCAAGGA AAGCTTCCCA GAAACATCCC CCTGGAGTAT 
CCTGCTGGGC 60 

20 

CTTACTGCTG CTACGTAGTG ATCATTGTCC TCAGTGTTAG CTGTAGTTCT 
CTTTCTGTTG 12 0 

CTTTGTCAGT AAAAAAGACA GCCAAGATCT CAACCATAAA TAC TTATGCT 
25 GCTTGCCCGA 180 



GAAACTGGAT TGGAGTTGGA AATAAATGTT TTTATTTTAA TGAAATACCA 
AGTAACTGGA 240 

3 0 CATTGAGCCA GACCCTCTGT AAGGAACAAG GGGCCGAGCT AGCACGATTT 
GACACCGAGG 3 00 

AGGAGCTGAA TTTCCTAAGG AGATACAAAG GGAGTTCAGG TTACTGGTCC 
GGTC TGC AC A 3 60 

35 



GAGAGTCATC AGCGCACCCT TGGAAGTGGA CAGACAACAC TC 



402 
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(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
GAAACATCCC CCTGGAGTAT CC 22 



15 (2) INFORMATION FOR SEQ ID NO : 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

25 

CCAAGTAACT GGACATTGAG CCAGA 25 



(2) INFORMATION FOR SEQ ID NO : 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACAGTAAAAT GCTCCAAGGA AAGC TTCCC A GAAACATCCC CCTGGAGTAT 
CCTGCTGGGC 60 

5 

CTTACTGCTG CTACGTAGTG ATCATTGTCC TCAGTGTTAG C TGTAGTTC T 
CTTTCTGTTG 12 0 

CTTTGTCAGT AAAAAAGACA GCCAAGATCT CAACCATAAA TACTTATGCT 
10 GCTTGCCCGA 180 

GAAACTGGAT TGGAGTTGGA AATAAATGTT TTTATTTTAA TGAAATACCA 
AGTAACTGGA 240 

15 CATTGAGCCA GACCCTCTGT AAGGAACAAG GGGCCGAGCT AGCACGATTT 
GACACCGAGG 3 00 

AGGAGCTGAA TTTCCTAAGG AGATACAAAG GGAGTTCAGG TTACTGGTTC 
GGTCTGCACA 3 60 

20 

GAGAGTCATC AGCGCACCCT TGGAAGTGGA CAGACAACAC TGAGTATAAC 
AACTCGGTTT 42 0 

CCATCGGAGG AGATGAAAAA CATGGCTTCC TGAGTGACAA TGGGTTCAGC 
25 AGTGGCAGGG 480 

GTTATATAGT GAGGAAGTCG ATTTGTAGGA AGCCCAACAG CTACACCTCA 
CAGTGCCTGT 540 

3 0 AGTTTTGTGT CCTTGGTTGA GACTTTGTCC TAACAGTCAT GAGGAACACA 
GAACATGGTA 600 

TCTACAGTGC CTGAATCATG AACAATCTGC TAAAATCATC TTCAATTCAT 
AATGTGTGGT 660 

35 

GACATCTAAG ATAACAACTG AGGCATATTT TGC TTGGG AG ATCATGAATT 
GTTCTATATT 72 0 
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AAATAGGTAT TCAGGTATGA GCTGGTTCTC ACATCTTAAA CATAAACTGA 
ATCATGTCAG 78 0 

5 TATTAGTTAT CTCTACTTTC TTTTTTCTCT CATTTAAATT ATATTATTTA 
TTTATATTCC 840 

AAATACCGTC CCCTCCTTGT TCCCCCTTCT AGAGTTGTTC ACTCCATACC 
CCTTCATCTT 900 

10 

TACTTCTGAA GAGATGTTCC CCCACCCCAC TCTGAGTATT TCCCTTCTCT 
TGGACTTTAG 9 60 

GAC TGTAC AG GATTAGGTGC ATCCTCTCAT AGTGAGGCCA ACTGTAGGGA 
15 GCTGCGACAT 102 0 

GCCGTGCCTC AAAATGGTGC TGGTTTCCGC CTTCCACCCT CCCAACAGTG 
AGCGCTCCTT 1080 

2 0 GTAGTAAACA AGTCC TTATT TGACTATGCC TGCCTGGCCT GCTAGGTTCA 
GCATAGTGAC 1140 

-' / 

AGCCTGTCTG CATGACCCAT GTGGCACGTT GGGGTTGGTT GGTGTTGGAT 
ACATAAGCTG 12 00 

25 

ATGTAGGGCA TTCCCCTGGG GTAGTAGATG ATTGTATCAA GGTTCCTGAA 
TAAACTGCTT 12 60 

GAAGAAAAAA AAAAAAAAAA AAGTAC TAGT CGACGCGTGG CC 13 02 

30 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

AGTAAAATGC TCCAAGGAAA GCTTCCCAGA AACATCCCCC TGGAGTATCC 
TGCTGGGCCT 60 

0 TACTGCTGCT ACGTAGTGAT CATTGTCCTC AGTGTTAGCT GTAGTTCTCT 
TTCTGTTGCT 12 0 

TTGTCAGTAA AAAAGACAGC CAAGATCTCA ACCATAAATA CTTATGCTGC 
TTGCCCGAGA 180 

5 

AACTGGATTG GAGTTGGAAA TAAATGTTTT TATTTTAATG AAATACCAAG 
TAACTGGACA 240 

TTGAGCCAGA CCCTCTGTAA GGAACAAGGG GCCGAGCTAG CACGATTTGA 
0 CACCGAGGAG 300 

GAGCTGAATT TCCTAAGGAG ATACAAAGGG AGTTCAGGTT ACTGGTCCGG 
TCTGCACAGA 360 

GAGTCATCAG CGCACCCTTG GAAGTGGACA GACAACACTC AGTATAACTA 
CTCACAGAGC 42 0 

C TC AG ATGGG GAGCCGGGAC TCTGAAATCC CAGAAAGCCA CTGC AGAAC T 
GCAAGCCTGA 480 

GATTTTGATG TCCACTATTT GCATGGCTGC ACCTGTTCAG GAAAGCAGAG 
ATTTTAAGGA 540 

CATTCGGAAC CTCCTTTAAA GTTTTGTCAT CACAGAGCAC CCAAAACAGT 
CCTCGAATCA 600 
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CAGGCCCAGT CCCATCCACC GTTAAAGCAC CTTTGAGCAA TTTAATAAGA 
AGTGCGTGTT 660 

CCCATGTGTA AAATGAATAA AAACAGAATT GGAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA 720 

AAAAAAAAAA AAAAAAAA 738 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AGTAAAATGC TCCAAGGAAA GCTTCCCAGA AACATCCCCC TGGAGTATCC 
TGCTGGGCCT 60 

TACTGCTGCT ACGTAGTGAT CATTGTCCTC AGTGTTAGCT GTAGTTCTCT 
TTCTGTTGCT 12 0 

TTGTCAGTAA AAAAGACAGC CAAGATCTCA ACCATAAATA CTTATGCTGC 
TTGCCCGAGA 180 

AACTGGATTG GAGTTGGAAA TAAATGTTTT TATTTTAATG AAATACCAAG 
TAACTGGACA 240 

TTGAGCCAGA CCCTCTGTAA GGAACAAGGG GCCGAGCTAG CACGATTTGA 
CACCGAGGAG 3 0 0 

GAGCTGAATT TCCTAAGGAG ATACAAAGGG AGTTCAGGTT ACTGGTCCGG 
TCTGCACAGA 3 60 
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10 



15 



20 
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GAGTCATCAG CGCACCCTTG GAAGTGGACA GACAACACTC AGTATAACTA 
CTCGCTTTCC 42 0 

5 ATCCGGGGAG TGGAAAGATA TGCCTACCTG AACGACATCG GGATCAGCAG 
TGCCAGGGTC 480 

TATGCAGACA AAAGATGGAG CTGTAGCAAA CTTAACAGCT ATAGCCTCCA 
ATGCAAAACT 540 



CCTTTTTCTC CTATGTAGCT TTTGATCAAG AGAGATGCTT TTTAGTC TGC 
TAAAAAAAAA 600 

AAAAAAAAAA AAAAAAAAAA 620 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCGAATGTTT CCTGCAACAC AAAGATGACA ACCCCAGCCT GCCACCATTT 
GAAAGGCCAG 60 

30 

AGGCTGAGGC CATGTGCACC TTCCATTTCA TTTCTGATGT TAAGAAATAT 
TCTCTATCTG 12 0 

GTTTGATAGC ACTTTGGGAC CATAGGGGAA AGAGTAGCAC CCACAGATAA 
3 5 CAGGCTAAAA 180 
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AGCGTCTCTT GGTAAATGCT AGGAAGGAAA AAAAGGAGTT TGGCAGTGGA 
GGCTATAGCT 240 

GTTGAGCTTG CTACAGATCC ACATCCGAAG TGAATAGATC CTGGTACTGC 
TGATCCCGTT 30 0 

GTTGTTCAGG TAGGCAAATC TTTCCTCTCC CCGGATGGGA ATCGTGTTGT 
TATACTCAGT 3 60 

GTTGTCTGTC CACTTCCAAG GGTGCTCTTT GGCCTCTCAG CTTTCAAGTT 
TCAATCCTGT 42 0 

AGTGGAAACT CAGCTCCTCA GC TC TG AG AT GTGTGTCACA AAGGCTTCCC 
TACCTATGCT 480 

TAGTCCCACA GGCAGCCCGC AGGTAGAAGT GGGTAAAATT CTCCAAGGAA 
AAAGGCACGG 54 0 

AACCATCTCC CCTGAGTCTT GTGCTAAGCT TTACTGC TAC TATGGAGTGA 
TCATGGTCCT 600 

CACTGTAGCT GTAATTGGTC TTTCTGTTGC TTTGTCAGCA ACAAAGACAG 
AACAGATCCC 660 

AGTCAACAAG ACCTATGCTG CTTGCCCGCA AAACTGGATT GGAGTTGAAA 
ATAAATGTTT 72 0 

TTATTTTTCT GAATACCCAA GTAACTGGAC ATTCGCCCAG GCCTTCTGCA 
TGCGCACAGA 780 

GGCCCAACTA GCTCGGTTTG ACAACCAGGA TGAGCTGAAT TTCCTAATGA 
GATACAAGGC 840 

GAATTTTGAT TCCTGGATTG GCCTGCACAG AGAGTCGTCA GAGCACCCTT 
GGAAGTGGAC 900 
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AGACAACACT GAGTATAACA ACACGATTCC ATCCGGGGAG AGGAAAGATT 
TGCCTACCTG 9 60 

AACAACAACG GGATCAGGGA ATTCCGGGAC ACCCGTCAGC ATTCCTGGAG 
5 AAAATTCGGC 102 0 

ATTCATGAGA AAACTGTCTT TCTACTCCAG TGCTCTCAGT GACCAATGGC 
TACTGAGTGC 1080 

10 TGCTTCATCT GAACTGATCT GAATTGAGGC AAATGTAGGG TTGGCTTCCT 
GCAGGAAGAC 1140 

TGTTCAAAGC CAAGCTCTTT CCCTTCTAGG TGCCTGGGTC TAGTGCACAT 
TAGTCTTGTT 1200 

15 

GGCAGCGTGT CTCCTCAGTC TGGCTATTGT GATCTTTCCC ATAGAAAGAG 
TCAGGAACGA 12 60 

GGGGAAGGGA AAGATAGAGG CCTAAGGTGA AATTTTAAAA AACTCAATCT 
20 GTTGGTTTGA 1320 

TTTGTGGTTT CATGTTTGGG TGCAATTGTT CTTGAGACAA AAGTAGAACT 
TTGAAATACT 1380 

25 TTATTTAAAG AAACGAGTGC TCTGGCATTA TTAAATAAAC CTAATGTAAG 
TCTATGAAGA 1440 

GTTTCACTTA AATACATTTA TATAAAGAGC CAATGTTAAA AGTGTTATGG 
ATAATAATTC 1500 

30 

TTCAAGGTGG TGGTTGTATT GGAACAAGTG TTCTTTCTGT CAGCTAGATT 
CCTGGTATAA 1560 

AATAATTTGA CTGCAGGGAA GTTGACAGAA AGCATTACTT CTGTATGCTA 
35 CAACCCTTTA 162 0 
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AAATTGTGCT CTGCCTCCAC CCATGTGGTG GTTTGAATGA AAATGTGGCC 
ATAGTCTCAT 1680 

ATTTGGATGT TTAATCACTA GGGAATGGAC C TGTTTG ATA GGATTAGAAG 
GATTGGAGGC 1740 

GAGGCCTATT GGAGGAAGTG CCATACTGTG GATGGCCTTT GCCTAGTCTG 
TCAACCCCAG 1800 

AGTTTTCATG CCTGAGTGCT CCCTGCTGGA TAATGGAGTA ACCCTCTGAA 
ACTGTAAGCA 18 60 

AGCTCCTGAT TAAATGCTTT CATTTCTAAA AAAAAAAAAA AAAAAAG 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TGCATTACAC ACACACACAC ACACACACAC ACACAAGGCC GGGCAGTGGT 
GGTGCACACC 60 

ATTAATCCCA GC AC TGGGG A GGCAGAGACA GGCAGATTTC TGAGTTCAAG 
GCCAGCCTGG 12 0 

TCTACAGAGT GAGTTCCAGG ATATCCAGGG CTACACAGAG AAACCCTGTT 
TCAAAAAAGT 180 

TACTTTTTGT ACCTTGAAAT CTAAAATATG TCTCAACTCT GTTTGTTTCT 
TTTACAGTAT 240 
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AACATGCTCC CCCCCCCCCC CCGCCGCCGC AGTTTTTCAG TTCCAGATCT 
AGGTAGGCAC 3 00 

5 CCAATCTCTG GCAGCTTATC AAGTCAGCTG ATGTAAAAAT AATCCCACAA 
CTCACAAAAT 3 60 

ATAGAGGGAA GACAGCGGGG AAAAAGGGGC GGGCTCATTG CTTCAGCAAG 
AAGATAGTGG 420 



10 



TGCATAGCCT CCCATGCCAG ATTGCTTGGA GACAGGAGAA AAACTGTACG 
TATTTAATGA 480 

AATGCTAACT AAACTAAAGT GGGGGAGGCT TCCTCAGGGG AGCTGGATCT 
15 TGCTCCTGTT 540 

AGCCTGCCAT AGTGGGTCTA TATAGACCAG CTGAGGCTGG GGTCGGGTGG 
ATGGTGGGAG 60 0 

2 0 CTCTGCTGTG GTCGGAAAGT ACCGATGCCA CTCTGGCTTT CTGGTATGGC 
CAATGTTACT 660 

TAAATACGTT TGGGAGGAGT GCAACCTTTT GAGTTTGTAA ATAAAAGCAG 
GTGCCCAGAT 72 0 

5 

TCCTGGAGGA TTGACTGGAG GACCTTGGGG GTGCTCTGGC ACACCCTGCC 
ACCCAGCCCA 780 

TACC TTAAGT GCCCCTCCTA CACACCTACC TACAACTTTC TTTTCAGGCT 
0 CCCACAGTAC 840 

TCCCCCTTTC CCAAACCTCC AAGC TTTTGG AATTTCTCTC TCTTCCCAAG 
GACACGGTAT 900 MG 

CAGGTAATAC TCTTTCTGGC CTTAAATGAC TCTTGTGGCA CCAGGGAAGG 
ATCAGTTTTT 9 60 
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TTCCAGTAGG GTGGGGGTGG GAGATTTATC CCATCTACAA ATCCATCTAC 
AGTTTTAGTT 102 0 

CACTGGGTGC TGGGAATGAA CCAAGTCCTC TCTCTGCAAG AGCAGCAAGC 
5 TCCCTTCCCT 1080 

GTTGAGCCAT G AC TTTACCC CCACTTTAAT AC TTTTGTTT AGGAATAAAA 
TATCAATTTT 1140 

10 CTTGAAAAGC AGAGTTCACA ATTGTTGTTA GATCAATGGC CTAGTGGCAG 
CCTGAGGATA 12 00 

CCAGGCAAGC TCCTTCAGAG TGGACAGCCT AGCTGCTAAG ATGATTGGAA 
ATACTGTTCT 12 60 

15 

GGGAGGTGGG GGACAGGTCG AGGAAGAGGG AGACCTAACC ATGCCTCCCT 
TCAACCCTAG 132 0 

GGCCCTACTC CATGCCATCC TGTGCACACC TAAAGTACCC TCCTCCACGG 

2 0 CTATCCTGGT 1380 

CCCTTAAACA GACCCTTAAT CAGAGTGTAG AACAGGGTCT TCTTGAGGCA 
GAGTAGCAGG 1440 

25 TATGATTGGC CTGCTGCCTT TGACTGTGAG CTATAGCCAG GTTCCACCAA 
GTTCCATACT 1500 

CCTCACAGTA AGCCATAAGC GCCTGTTTGT GTTGGGAAAA CTTAGAAAAA 
GTAAAGATTT 1560 

30 

TCTTTTGTTC TTGAGACTTT TCTATGGGTT AAAAATGGCG AGCCAGGGTC 
CTACCAGCAG 162 0 

TGGGTCAGGG GGACATGAAA GGGACCTGAA TTTGGTGAAA AGGTACTGGA 

3 5 TGTGGTGTCT 168 0 
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CACAAGTGGG TCTTTTCTAG AAGCCCACCG CAGCTTTCTT TTTATGG TTT 
TAACCTCCCT 1740 

CTGGCTTAGT CCCCAAAGAG GAAAGGGCCG TGGAGGAAAT ACTAGTTGTG 
TTCTTGAAAA 1800 

AAAGCCATGT GC TTTTTGAG GGCAAGTAGT TTTAAAGGCA ACGCTTGAAG 
GGGCCCTTAA 1860 

AGAAAATCCG GGTAATGGCA GGAAGGTATT TTTATAAATT TTTGGTAAAG 
GAAAATAACC 192 0 

TATATGGCCC CAACTGGGTT GAGTCCCAGC GTGGTCACCT TGGCTGGGGA 
CAAGGGTGCT 1980 

TCAGCTCCCA TGGATGTGGG AACTGGGCTT TGGGTGCCAG GGCTGGGGTG 
GTGGAAGGTT 2040 

TTGGGGTATG GCCCAAGTCG GAGGCTCCCA ACCAAACAGG CATCCAGCAT 
CTTTCCAGTG 2100 

GATGAGTGTG GAAAACCTGT GATACATACT CCCATATATA CTGGAATACT 
ATGTACTAGT 2160 

AAGATAGGAT GTC TTTTGTG ACAACATGGC TGGACCTGGG GTGACATGCT 
GAGAC AAATT 222 0 

AGTCAGGCAC TGAAAGGCCA ACATTGTTCA TCAGTTGTAG AGGGGTTTGT 
TAGCTAAAAG 2280 

CAGACAGGAG TTTAC AC TCT TTTCTTCGAT TTGGAAAGAT TTTTGAAATC 
ACAGTGCAGA 2340 

ACCTGAAATC ACAATGAAAC CAAACCACTC C TTTAC AATC TGAAGGGGTT 
TAG AAATC TC 2400 
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CCAAGACTTC CTTTCTATAG GGAGTGTGAG GAGGGCTGAG GAGGGCTCCC 
AGCAGCACAT 2 460 

GGCTGAGAGG TGCTGGGGCT GGAAATGAGC ACAGGCGAAT TTATTATGCT 
5 ATC ATTTT AT 2 52 0 

ATTCTGTAGA ACTAGAAAGA ATTAAGGCTG GGAGTTCTGT GTGGATCCAA 
AATGC AAAAG 2 580 

10 CTCAGTGCTT AAAGCCTTCT TTCTAATCCT AAGGCTCCTT TCCCTCCTTG 
TTAATGTAAT 2 640 

AGAAGCTTTC TGGTATTTTA GGTGTGCGAA AATGCACAAA ATGCAAGGAT 
TAAAGTCAGT 2700 

15 

GAAAACTCTG TAAAAAC TAT AATTAGC AC T CAATAAAATT AATTCATTTG 
GTATACATTT 27 60 

CTGTGAATTT TGAAAACATA TAATCAGGTG TTCTTCATTA AGATACATAG 
2 0 GGGCTGGAGA 2 820 

— y 

CTTGGCTCAA CCACTGAGAG CATTTATTGC TCTTGCTGAG GACTGAGGTT 
TCACTCCCAG 2880 

2 5 CACACATATG GTGGCTCAAC ACCCACCCCT AATTCCAATT CCAGGGATCC 

AATATATTTT 2 940 

CTAAATTCCT CTAACAGTAA TCATGCATGT AGTACACTAC ATACATACAT 
AC ATACATTA 3 000 

30 

CATTCACACA TTCTTACATT TAGCTGACAA AGCACTCTTA AATG TAAAAT 
AAATAAGACT 3 0 60 

AAAACAGTCA TTTTAAAAAT ATATACAGAC CCCCTACCCT ACCTGTTTCC 

3 5 CCGTTGTCTG 312 0 
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CTGCAGACAC TCTCACCACT CCTCCGCCAC AGCCATGAGT AGTCACCTTT 
CCAGATGACT 3180 

TAAAATGGGT CCATGAAGCA GAGAAGTCCC ACAAGAGTTC TTTCAGCTTG 
5 TCACAGCAAT 3240 

GCCTTCTGCT CATCACTCAC AGTGCAGTGC CAATCAGTAG TGTGTCAGAA 
AC ATGC AC TG 33 00 

10 CTGGTGAGAT GCTGAGGGAT CATACCCATA GCATCGCCTA CACAGAATCA 
TGCTCTGAGT 33 60 

TCAGAAATTT TTAAGAATCT CACCAGCAAA TACTATGCAA AGAGGTTGTG 
AAAAGCTGTC 342 0 

15 

AGGAAACTTC TAGAGAAGTG ATAGGAGGAA GTGAATAGTG GCAGTTGGGG 
GTCTCTTCAC 3480 

AAAGGAAACT GGGACTTCCT GTAGCTCTCT GACCTTTGCA TGAGCTTACT 
2 0 TTCGGGTTAG 3540 

TTTAGGGACA CTTTGGGGAA GAAGCCCTTG GGACATTTGG CCTGTTAAAG 
TGGCATGAGA 3 600 

2 5 TAAGGCAAGC ACAGGCATGT GTTCCAAGTT GTTTCTTGTG TTGAGAGGTT 
TACCTTGTCA 3 660 

TCAGCTTGGG GATATTTTAA TGGTCACAAA TGTGTCATTT TCACAGGGAT 
CCTTAAAGCT 372 0 

30 

GCTGCAAATA TTCACATAAA GATGTCTTGC AACTTGAATT CCTTTCCAGC 
ATGGGAATAT 3780 

GTGGGTAGGA TGGGAGCATA TCACATTTTA CACTTACAAA CAGCTTTGTA 
3 5 GAAGCTGTAA 3 840 
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AATTTAGCCT TAAGAAGTTG TTAGTTCTAC CTCAACATGG ACATCCACAT 
CAATGTATAA 3 9 00 

CCATCCTGTT ATGC AG AC AG TGATTTTGCT CTTAAATCGA AGATGATTTT 
5 GCCCAAGACA 3 9 60 

AGTTCACAAA CATTCCCTTA C TTTTC T AAA AATCAAATGA CTTTATGATA 
TTAAGTTTTG 4 02 0 

10 TGCTTGGGAT CTCTATGTCT ACAAATGGAC TGTAGAAATT T ATGC C T ATT 
TATTTATTTA 408 0 

TTTATTTTTT TAGGAGACAG ATGGAAGGGT GTTTCAGTAG CACACACGGG 
GAGTGATGCC 4140 

15 

ATCTTTTCCC TGTCTAAAGA CTGGTTCATC TCTGGTTAAG TGGTCTCTTG 
ACCACCCACA 42 00 

TGTGTTTGAC CTCATTGTGG AGTCCTGTTT TCTGCTGTGT TGTTTCAGTG 

2 0 TCTACTCTGA 42 60 

- 

TGCTAGCACC AGGCTTTCAT TCCTGTTCCC ATGAGAACAT GAGACATCAG 
GTCAGGCCTT 432 0 

25 GTGATCTTTC TGATTTTGAC CTCTTCATTC TCAGAATAAT TTTTG AC TAT 
TAATTTTTG A 43 80 

CTCTTTTTAA TTTTCATATT ACTTCTCATA CAACTTGGTG CTATGATTTT 
TTTTTTTC TG 4440 

30 

AAGGCAACTA CATCTCTGGA ATATGTTACA CATATATGTG TGTTCAGAAA 
TTGTTGACAT 4500 

GAGCAATATG GAGTGTTCTA GTTCATTATG TGTTTAGCGC TGCCTTGATT 

3 5 TC TTTC TTT A 4560 
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TTTCAAAAAT GTAACATGTA CGAGTGCTTC GCCTGCATAG ATATCTATGG 
ATTGTGTGC A 462 0 

TTCC TAG TGC CCACCAAGGT TAGGAAAGTT CTCAGACTTC CTGAATATGG 
5 ATTTACAGAG 4680 

GCTTGTGAGC TCTTATATTG GTTC TAGATT TAACCCAGGT CCTCTGAAAG 
ATCAACAAAT 47 40 

10 GTTCGAAACC ACTGAGCCAA CTTTCCTTTT TATTTCTTTA TATATTTACA 
TGTAGTTTTT 4800 

GTTATGCTGA CTATGAACAT CCTATTTTTA GATTTGTAAT TTTATGTTTT 
TTTGGCATTA 4860 

15 

CTGTCTAAGA TATTAATTTC C AC TTG AC AG GACAATACAA CTGATTTTAC 
CATGCTGTCA 492 0 

CCCTGTCCTG CAGTTTTATA AACTCATTTT TTTTTTTTTT TTTACATCCA 
20 ACGGTCTTTT 4980 

GTAGATTTGT TTGGATTTTT ATGTAGATAG AACCCTGTCA TCTGTGAAAA 
GAAACACTTT 5040 

25 CACCTGTTTC TGAGAAGCAT AAACCTCCTT TCGTTCTTTA AGAGACTCAT 
TTTATGTATG 5100 

TGGCTGTTTG CCTGCAGGTT TGTACATAAC ATGAATGCCC AGTGTGTACA 
GAAGCCAGAA 5160 

30 

GAGGGCAATG GACCCCTTGG AACTGGAGGT GTGCAATGTT GTAAGCTACC 
TTC AATTCAA 522 0 

GTCCTCTGGA AGAGCTGAAA GAGATCTTAA CAGCTGAGCC ATTCTCCAGA 
3 5 CACCTGACCC 5280 
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TATTTCTTTG TCCTGGCGAG CACCTCCTGG G AAAAGTC T A CCTGGAGTAA 
TGAGC AGAC A 53 40 

TCTGACTCTT GCTCCTGATT GCTGGGAACA CATTCATAAT GGCACCATTG 
5 AG AGTGC AGC 54 0 0 

TGACTGAAGT TGGACTTACT GCTTTGTTTT TAATGGATGA ATTTTGTAGG 
GTTGAAGAAG 5460 

10 TTCCTTTCAC TTCCATTTTC CACATATTTT TGTTGTTTAT TGAGATATTT 
CTAATTTTTC 552 0 

CTCTTTTCTT ATTTTGTAAT ACATCTAATT ACATCAATTT ATTTTC TAGT 
GTTTCTACAA 5580 

15 

CTCCTTCTAG CCAGGGCGCT TTTTATGGAG CCAAAACCAG ATTTTTGTTG 
TTTGCATGTC 5 640 

ACTGGGATCT CCACTCCGTC CATTTTTGCT CTTCCATTAT TCACCCTGAG 
2 0 TTCAGTGATC 57 00 

-*■' s 

AGAAGTCTCC CTGGCAGAAG TTCTGTTTCC ATCCTCTCTA GACTTCTTTC 
TACCATGGAC 57 60 

2 5 ATCC TTTGTG GACCAGGGCC TTCTCTAAAA GGGTTGGCAG AAGCCTTTCA 

GG AAC TG ATA 5820 

GGCAATTGTC AAGTGGGTTT TGTGGTTATT TTATTTTATT TTATTTAAGA 
AATTCTTGAA 5880 

30 

ATTCGCATTC TTATATTCAT ATTTTACAGA TAAAATTCTC CAAAGAAAAA 
GTCTCAGAGC 5940 

CATCTCCCCT GAGTCTTCTG CTAAGCTTTA CTGCTGCTAT GGAGTGATCA 

3 5 TGGTCCTCAC 60 0 0 
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TGTAGCTGTA GTTGCTCTTT CTGTTGCTTT GTCAGGTAAG TGACATACCC 
TCCAAATTCT 6060 

GTGACACTCT GTCCATATTC ACATTGCCAG TTATGCTTTC TAAGCACTGT 
5 GATCCAGGCA 612 0 

CTGTGGCAAG GGC TC TAG AG G AAAC AC AC T GGAAGGTCCT GTTCTCTGAG 
AATTTAGGTT 6180 

10 CCAACAGGAA GATGCAGTGA AGGAACACAG AGGCTTTGAT GGGGACATCC 
CCGGGAAGAT 6240 

GACATCCAGC AAGCTCTAGA CAGAGATGCA GGGGACATAG GTCCCCTTGG 
GGGAACAATT 6300 

15 

CAAGGCAGAG AATAACAAGA GGGAATCTCC AAGTAGAAAC TTCAAAGGTG 
AGGCCAGGAA 63 60 

GGTACAGTGC CTTTGACCAT GACCCATGAG TTTAGATACC AGGCTCAACT 
20 CTATTTTGAA 642 0 

AGTATTAAAT GGAAAGTTCC TGAAGTAAGA AATTTATAGG ATTTTAGTAC 
CACAATATTC 6480 

25 AGAATAGTGC AATAC AATC T TGC AC TGTCC TCTTAAGTAT TTGAAGTCAT 
CCTTTAGTGC 6540 

AAGGTGTCTG CACCGTATAT ACGACCTACC CAAAAATTCT CATAGAAATC 
TCAATTATCA 6600 

30 

GGCTGGGTGT CAGTAGGTGT CCCTACAGAG TGCTTGCTGC TGTAGCAATC 
CCCACTGTAG 6660 

TCAATGGTCA TCCAAAGCTC AGAAAGTGAT GCTGTTATAG AAGTGCACTC 
3 5 CCTGGGAGCC 672 0 
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CTACTGACAG TGAGCACCTG AGAGAGAATG GGACACAGGC CCACGGTGGG 
AGGCCTTTAG 6780 

TTAAAGGCAC ATCTCGATCA GGAGAGGATT CC TAC AG ATC AG TTAGG AAA 
5 GCTACCATCA 6840 

GATTCACACC TCACAGCTGA GCTCAGGAGA GTGTGGCAAA ACGAGAGAAG 
ACCTGCTTGC 690 0 

0 TATGATCCAT CATATTCTCT ACATTTTAGT AACAAAGACA G AAC AG ATC C 
TAATCAACAA 6960 

GACCTATGCT GCTTGCCCGA AAAAC TGG AT TGGAGTTGGA AATAAATGTT 
TTTATTTTTC 7 02 0 

TGAATACACA AGTAACTGGA CATTTGCCCA GACCTTCTGC ATGGCACAAG 
AGGCCCAACT 7080 

AGCTCGGTTT GACAACGAGA AGGAGC TGGT AAGCAATGGG CAGGGATTGG 
TTTGTCTGTC 7140 

TGTTCTGTTG AATATTATAT TGCC TTGAGA TAGAGAGTTA CAGATGAGGC 
CCGAGGAAGG 72 00 

GATCCCACCC AAGCACATGG AGACATAGGG AATGTGAGTG TGTGCCATTT 
GCTGATGCTT 72 60 

GACTTCTGAC TGGAGCCCTG AGATAGTCAA GAAACATTCT CTCATGAAGT 
GCTCATAGTC 732 0 

AGCTGGAAGG TCAAATATGC CATTTTACTG GGATACCTGG TGACCATGAG 
TGTTTTCCCA 7380 

TATGC TGGC A TATGTTGGGT ACAGAAGGAG ACAACTGATA ATAACTGCAG 
TGGAAGGTTA 7440 
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ACCCAGAACT GTCCAAACCA CAGAGGAATG TGACCCTCAG TTACATCCTC 
CTGTTATCTC 7500 

TAGAGAAAGG TGTGGAGTGG AGAGACTCCA GGATCATCTG AAACAAATAG 
5 ACAC ATGTAT 7560 

TCTTGACTTT TTTGTGTTTT ATGACAGAAT TTCC TAATG A GATACAAGGC 
AAATTTTGAT 7 62 0 

0 TCCTGGATTG GACTGCACAG AGAGTCGTCA GAGCACCCTT GGAAGTGGAC 
AGACAACACT 7 680 

GAGTATAACA ACATGTATGT TTTCACGATG TTTTTCCTTC TATTATGTTC 
ATGTGTTGTG 7740 

ATATGTGTGT GTCGTGGCTA TGAGAGATGG AAGTCAATGT CATGTGAAGC 
CAACTGTACT 7800 

GGGAAGAAAG AAAAAAAAAT GAACCCTTGC CTGGAGGTGT GGCTCAGGGG 
) TAGAGAGTGT 7860 

GTATAAATGC AATATCCAAT CCCCAGAAAG CTCTACACAC CCACAAATTT 
AAATACTTCA 792 0 

GAGGTTTTCC TGTTTATTGA CCGTCATTTT CAAAACCTTT GCATCATGTC 
ATTTTACTCA 7 980 

AAATATTTAC CATAATGATG GTGTCTGAGA GTAGC TATTG TTTGCTCTGG 
CTCCAACTTA 8040 

AACATTTCTG TTGTTGATAA ATGTCCTGTG AGGGATATAG ACAGAGCCTT 
AGATGGGCAG 810 0 

TGGGGGCTCT GGAATCCCAG AAAGCCACTG CAGTATCTGC AAGCCTGAGA 
TTCAGCTTTC 8160 
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CACTATTTGC ATGTCTGCAC CTGTTCAGGA AAGCAGAGAC TCTAAGTACA 
TTTGG AACC T 8 2 2 0 

CCTCTAAAGT CTCGTCATCA CTGAGCACCC AAAACAGTCT TGGGTTTGAG 
5 CTGTTTTACT 8280 

GGGATGGTAA ATCACAGACT CAGTCACATC CATCACTGAA GCCCTTAGAG 
CAATTTACTA 8340 

10 AGTGGGCGTC CCCATATATA AAATGCCTAA AACAGAATTG AAAATCACCC 
TTGGTGGGGT 8400 

CACTCATGGC TGCAGTTCAT TTGAACATGG CAGCGAGCAC CAGCCCAATG 
CCTTGTACAC 84 60 

15 

ACATTACAGG ATTCACCATG GACAAATGAC AAAGGAGTGG TGTTCAAATC 
CTGAGAATAT 852 0 

GAGACAGTAG GTGTAAAACT AATGCAGGTG ATTCCTCAGG G AC TTTTTG A 

2 0 TTCATATTAC 8580 

' s 

CAAAAATTAG TGG AG AC TGG TGAGATTTCA TTGCAGGAGC AAATGCAGTT 
CTGGGCTCTG 8640 

25 TAGGCTTACT TTTTTGGTTT CTTTTCAGGA TTCCCATCCA GGGAGTGGAA 
ACATGTGCCT 87 00 

ACCTGAGCGG CAATGGGATC AGCAGTTCCA GGCACTATAT ACCTCGGATA 
TGGATCTGTA 87 60 

30 

GCAAGCTTAA CAACTATAGC CTCCACTGCC CAACTCCTGT TCCTGTCTAG 
CATTTACCAA 882 0 

GAGACTCTTC CTAGCCTGTT ATCTATGGGT GCTACTTTTT CCCCTATGGT 

3 5 CCCACAGTGC 8880 
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TATCAAACGG GATTGAGAAT ATTTTTTAAC GTCGCAAATG AAAACCATCA 
AGGCTGGAGA 8940 

GATTGCTCCG TAG TT AAG AG ACTGACTGCT CTTCTGCATG TCCCGAGTTC 
5 AC ATC TGAGC 9000 

AACCACATGG TGTCTTACAA ACATCTGTAA TGACATCTTA TGTCCTCTTC 
TGTGGTGTGT 9060 

0 GAAAACAGCT AC AC TATACC TACATATGAT AAATAAGTAA ATC TTAAAAA 
AGAAAAAG AA 912 0 



AACCACCTTA GAGAGGTGCA CACATGGAGG ATTACAAGAC CATAGATGAG 
TTTTAAATAG 9180 

5 

ATGTCAGCAC TCATACCTTA AGCCTAAAGT ACAACTAATG TTAGGGAACC 
CCACTTTTAT 9240 

GATATTAAGG TTTTGTGCAG AG AATTC TTC TTTTGAATTT ATGAGACCAC 
) AAAAATGAGT 93 00 

CCCCCAACAT GGGTGTAACC TTTAATAATG AAAGCAGAAT GGCTGGGAT 9349 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATAGTGGTG CAGAGCCTCC CATGCCAGAT TGCTTGGAGA CAGGAGAAAA 
ACTGTTTGTA 60 
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CATAACATGA ATGCCCAGTG TGTACAGAAG CCAGAAGAGG GCAATGGACC 
CCTTGGAACT 12 0 

5 GGAGATAAAA TTCTCCAAAG AAAAAGTCTC AGAGCCATCT CCCCTGAGTC 
TTCTGCTAAG 180 

CTTTACTGCT GCTATGGAGT GATCATGGTC CTCACTGTAG CTGTAGTTGC 
TCTTTCTGTT 240 

10 

GCTTTGTCAG TAACAAAGAC AGAACAGATC CTAATCAACA AGACC TATGC 
TGCTTGCCCG 30 0 

AAAAACTGGA TTGGAGTTGG AAATAAATGT TTTTATTTTT CTGAATACAC 
1 5 AAGTAAC TGG 3 60 

ACATTTGCCC AGACCTTCTG CATGGCACAA GAGGCCCAAC TAGCTCGGTT 
TGACAACGAG 42 0 

2 0 AAGGAGC TG A ATTTCCTAAT GAGATACAAG GCAAATTTTG ATTCC TGGAT 

TGG AC TGC AC 480 

AGAGAGTCGT CAGAGCACCC TTGGAAGTGG ACAGACAACA C TG AGTATAA 
CAACATGATT 540 

25 

CCCATCCAGG GAGTGGAAAC ATGTGCCTAC CTGAGCGGCA ATGGGATCAG 
CAGTTCCAGG 600 

CACTATATAC CTCGGATATG GATCTGTAGC AAGCTTAACA ACTATAGCCT 

3 0 CCACTGCCCA 66 0 

ACTCCTGTTC CTGTCTAGCA TTTACCAAGA GACTCTTCCT AGCC TGTTAT 
CTATGGGTGC 72 0 

3 5 TACTTTTTCC CCTATGGTCC CACAGTGCTA TCAAACGGGA TTGAGAATAT 
TTTTTAACGT 7 80 
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CGCAAATGAA AACCATCAAG GCTGGAGAGA TTGCTCCGTA GTTAAGAGAC 
TGACTGCTCT 840 

TCTGCATGTC CCGAGTTCAC ATCTGAGCAA CCACATGGTG TCTTACAAAC 
5 ATC TGTAATG 900 

ACATCTTATG TCCTCTTCTG TGGTGTGTGA AAACAGCTAC ACTATACCTA 
C ATATGATAA 960 

0 ATAAGTAAAT CTTAAAAAAA AAAAAAAAAA 990 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCCCATGCCA GATTGCTTG 19 

(2) INFORMATION FOR SEQ ID NO : 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14: 
GGGACCATAG GGGAAAAAGT AG 22 



H : \PClsrlc.\K. e p\specis\ST VINCENTS osteoclast .doc JO/06/99 



- 28 - 



(2) INFORMATION FOR SEQ ID NO : 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCCCATGCCA GATTGCTTGG AGACAGGAGA AAAACTGTTT GTACATAACA 
TGAATGCCCA 60 

GTGTGTACAG AAGCCAGAAG AGGGCAATGG ACCCCTTGGA ACTGGAGGTA 
AAATTGTCCA 12 0 

AGGAAAATGT TTCAGAATCA TCTCCACTGT GTCTCCTGTT AAACTTTACT 
GCTGCTATGG 180 

AGTGATCATG GTCCTCACTG TAGC TGTAAT TGCTCTTTCT GTTGCTTTGT 
CAACAAAAAA 240 

GACAGAACAG ATCATAATCA ACAAGACCTA TGCTGCTTGC TCAAAAAACT 
GGACTGGAGT 3 00 

TGGAAATAAA TGTTTTTATT TTTC TGGATA CCCACGTAAC TGGACATTTG 
CCCAGGCCTT 3 60 

CTGCATGGCA CAAGAGGCCC AACTAGCTCG GTTTGACAAC GAGGAGGAGC 
TG ATTTTCC T 42 0 

AAAGAGATTC AAGGGGGATT TTGATTGCTG GATTGGCCTG CACAGAGAGT 
CGTCAGAGCA 480 

CCCTTGGAAG TGGACAAACA ACACTGAGTA TAACAACATG AATCCCATCC 
TAGGAGTGGG 540 
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AAGATATGCC TACCTGAGCA GCGATAGGAT CAGCAGTTCG AGGAGCTATA 
TAAATCGGAT 600 



GTGGATCTGT AGCAAGCTCA AC AAC TATAA CCTTCATTGC CAAACTCCTC 
CTGTCTAGCA 660 

CTTACCAAGA GACTCTTCTT AGCCTGTTAT CTATGGGTGC TACTTTTTCC 
CC TATGGTCC 72 0 



(2) INFORMATION FOR SEQ ID NO : 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16: 
TGGAAACTCA GCTCCTCAGC TCTG 24 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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TGGAAACTCA GCTCCTCAGC TCTGAGATGT GTGTCACAAA GGCTTCCCTA 
CCTATGCTTA 6 0 

5 GTCCCACAGG CAGCCCGCAG GAGGTAGAAG TGGGTAAAAT TCTCCAAGGA 
AAAAGGCACG 12 0 

GAACCATCTC CCCTGAGTCT TGTGCTAAGC TTTACTGCTA CTATGGAGTG 
ATCATGGTCC 180 

10 

TCACTGTAGC TGTAATTGCT CTTTCTGTTG CTTTGTCAGC AACAAAGACA 
GAACAGATCC 240 

CAGTCAACAA GACCTATGCT GCTTGCCCGC AAAACTGGAT TGGAGTTGAA 
15 AATAAATGTT 3 00 

TTTATTTTTC TGAATACCCA AGTAAC TGG A CATTCGCCCA GGCCTTCTGC 
ATGGCACAAG 3 60 

20 AGGCCCAACT AGCTCGGTTT GACAACCAGG ATGAGCTGAA TTTCC TAATG 
AGATACAAGG 42 0 

CGAATTTTGA TTCC TGGATT GGCCTGCACA GAGAGTCGTC AGAGCACCCT 
TGGAAGTGGA 480 

25 

CAGACAACAC TGAGTATAAC AACACGATTC CCATCCGGGG AGAGGAAAGA 
TTTGCCTACC 540 

TGAACAACAA CGGGATCAGC AGTACCAGGA TCTATTCACT TCGGATGTGG 
3 0 ATCTGTAGCA 600 

AGCTCAACAG CTATAGCCTC CACTGCCAAA CTCCTTTTTT TCC TTCC TAG 
CATTTACCAA 660 

3 5 GAGACGCTTT TTAGCCTGTT ATCTGTGGGT GCTACTTTTT CCCCTATGGT CCC 
713 
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(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TTTGTCAGCA ACAAAGACAG AACAG 2 5 

15 (2) INFORMATION FOR SEQ ID NO : 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 

CGGGTGGGCG CGAGAGAGCC TAGAAGCCCA TGTAGCCGCG AATCCCGCAG 
CCCCAGTACA 60 

CCTCCCTCCG TGCCTCCCCG CCTTTTCTGC AGAGCTCCGC CCTGGAGTGA 
3 0 AGGAGGAGCC 12 0 

GTCACCTGGA GCTCCGAAAA AAGCAGAAGA AGGCGC TTTT TATTTAGCCA 
GTGTGACCCC 180 

3 5 GCCAGGGCCT TCTCGGTTGG GTGAGCACTC TCTCTGACCA GGCCATGAAA 
AG AAAAATC T 240 
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GTGCGATGCC TCCCCACATG TCACGGGACT CTGACTTGCC TTTGTCGTCA 
GAGTTTGCAG 30 0 

AACTTTGGGG GACC TG AG AG GGGAGTGCCC CCTGGACGGG CCACGGCTGT 
CTGTGGCTTA 3 60 

AGGGCTTTTG GAAGGGCGGA GAGAGGGAAA CGGCGTCCTA GTGGCCTGCT 
TCAGGGCCAC 42 0 

CCACGGGCCC TCCCCCAACC TCTCTCTGAT CCAACTTGTT TTTCCAGCCT 
AGTTGGAAAC 480 

TTGTGGATGC TGTGACCTCA AG AAG AC TTG GCATTTTATT TGGAAGATAG 
AC ATC TATTT 540 

GCAACTGGTC CTGAGCCCCT ATTTTCCTCC CACCTTTCTT GGGGAAACTT 
GTTTTTAAGG 600 

GGTGCCACTG TTTTTGTAAC ATGTTGCTCC TAGCTCTTAG CATTCATGGT 
ACTGTTGTAA 660 

ATGGAGAAAG AGTAATTCAC GCAGAGCCGG C TTTGC AG AT AAAACTCTGC 
AAAGACAATG 72 0 

TCTAGCACTT ATATCTCCAG TAACTCCTGT CAAGGTTTAT TGTTGC TTGT 
TCATCCATTG 780 

CAGTCCTGAC TACAAGTGTG ATTGC AC TTT CTATTGTTTT GTCAGAATTT 
CCTGAAAAGA 840 

TACAAAGGCC CTTCTGACCA TTGGATTGGC C TT AG AAG AG AATCATCCCA 
TCGCATTTGG 9 00 

AAATGGACAG ACAACATGGA ATATAATAAC ATGCTTGCTA TCAGAGGAAG 
TGGAGAATGT 9 60 
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GCCTTCCTGA ATGACAATGG AGTCAACAGT GGCAGAATCT ACATGAACAG 
AAAATGGATT 1020 

TGTAGCAAGC CAAACAATTA TGTCTACAGT TGCCAGTTAT GTCCCCACTG 
5 GGATACTACC 1080 

TAGTAGAGCT GTGAGAAGAG GGCCACCATC CTCCAGACTC CAGAATGGTG 
GAATCATCAG 1140 

0 CAGCTTCCAC CATGCCCCTG GAAAAACTGC AAGTAACAGA CCTGCACATG 
TATCCCCTAC 12 00 

ATCTAAAAAA AAAAAAAAAA AAAAAAAAA 122 9 

5 

(2) INFORMATION FOR SEQ ID NO : 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 6 base pairs 

(B) TYPE: nucleic acid 

3 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CGGGACAATG TTATGTGGCT CAGAGGCCCT CCATGTATTC TTAAC TATTC 
ACTCTCCTAT 60 

CCTTCCAAGA ATAAC AC TAA TTTGACCTCT ACAATAATCA CTTTATCACT 
CCAGTTTTGC 12 0 

CTTTTTTCCT CCAAAACAAT GCCTTTTAAG TCTATTTTAA TCGATAGATT 
TCCTCTTAAT 180 

ATCATTTAAA AATATTTC TT TACATTTTTA GACAGGAATC AGAATAATTT 
GCTATGTTGA "240 
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ATTTCCAGTT ACTTGGATTT TGTTGATTTC ATTCCTGTGG TTTAGTTGAC 
ATGAATCTCT 3 00 

5 C C AATTG AAA GGGTAACTTG AATATGGTAG CTGGAAAGTT AAAATCAATT 
C TTTTAACTT 3 60 

TGGAAAATGA TTAAATTCTG GAGATAGAAT AGGTAAGGTT CATAAGATGA 
C AGGGTCGTT 420 

10 

TGCATCCTTC TAGTGGAAAA GCGAAGGGAA TTAAATAAAA ATAACACTTT 
GATGCTTAAT 480 

GTTTCTGGCA GTATTATGTC TGTATTTAAT TGTTAAAATG TTTTTCAATA 
1 5 ATTTTTTCCA 540 

AGCGGGTTGT CTGCATTCAA AAGAGCATTC TATTAAAGCT ACCTTAATTT 
GGCGCTTATT 60 0 

2 0 TTTC TTAATC ATGTTTCTGA CAATCATAGT GTGTGGAATG GTTGCTGCTT 

TAAGTGCAAT 660 

- / 

AAGAGCTAAC TGCCATCAAG AGCCATCAGT ATGTCTTCAA GCTGCATGCC 
CAGAAAGCTG 72 0 

25 

GATTGGTTTT CAAAGAAAGT GTTTCTATTT TTCTGATGAC ACCAAGAACT 
GGACATCAAG 780 

TCAGAGGTTT TG TG AC TC AC AAGATGCTGA TCTTGCTCAG GTTGAAAGCT 

3 0 TCCAGGAACT 840 

GAATTTCCTG TTGAGATATA AAGGCCCATC TGATCACTGG ATTGGGC TG A 
GCAGAGAACA 90 0 

3 5 AGGCCAACCA TGGAAATGGA TAAATGGTAC TGAATGGACA AGACAGTTTC 
CTATCCTGGG 9 60 
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AGCAGGAGAG TGTGCCTATT TGAATGACAA AGGTGCCAGT AGTGCCAGGC 
ACTACACAGA 102 0 

GAGGAAGTGG ATTTG TTCC A AATCAGATAT ACATGTCTAG ATGTTACAGC 
5 AAAGCCCCAA 10 80 

CTAATCTTTA GAAGCATATT GG AAC TG ATA ACTCCATTTT AAAATGAGCA 
AAGAATTTAT 1140 

10 TTC TTATACC AACAGGTATA TGAAAATATG CTCAATATCA CTAATAACTG 
GGAAAATAC A 12 00 

AATCAAAATC ATAGTAAAAT ATTACCTGTT TTCATGGTGC TAATATTACC 
TGTTCTCCCA 12 60 

15 

CTGCTAATGA CATACCCGAG ACTGAGTAAT TTATAAATAA AAGAGATTTA 
ATTGAAAAAA 132 0 

AAAAAAAAAA AAAAAA 1336 

20 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10221 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GAATTCCTTC TTTTTC TATT GTTTGGAATA ATTTCAGAAG GAATGGTACC 
AGCTCCTCTT 60 

35 

CGTACCTCTG GTAGAATTCG GCAGTGCATT TGTCTGGACA TGGGCTTGTT 
TTGGTTGGGT 12 0 
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AGGC TATTAA TTACTGCCTC AGTTTCAGAA CCTGTTATTG GTCTATTCAG 
GAATCTGATT 180 

5 TCTTCCTGGT TTAGTCTTGG GAGGGTGTAT GTGTCCAGGA ATTTATCCAT 
TTCTTCCTTG 240 

CCTGGGTATC ACCAGCAAAG GCTGAAGAAA AGCAAAGATT GCTGCCTGCT 
CCTTCCTCTG 3 00 

10 

GAAGCTTCAT CCCAGAGGGG CACGCACCAG ATGCCAGCTG AGCTGTCCTG 
TATGAGGTGC 3 60 

CTATCAACCC CTGCTAGGAG TTGTCTCCCA GTCAGGAGGC ATGGGGGTCA 
15 GGGACCCACT 420 

TGAAAAGGCA GTCTTTCCCT CAGAAGAGCT CGAGCACTGT GC TGGGAG AT 
CCACTGCTCT 480 

2 0 TTTCAGAGCT GGCTGGCAGG AATGTTTAAG TCTCC TGAAG CCGTGACCAC 

AGCCACCCTT 540 ,. y 

TCCCCCAGGT GCTCTGTCCC AGGGAGATAA GAGTTTTATC TATAAGCCCC 
TGACTGGGGC 600 

25 

TGCTGCCTTT CTTTCAGAGA TGCCCTGCCC AGAGAGGAGG AATC TAG AG A 
GGCAGTCCGG 660 

CTGCAGTGGC TTTGCTGCAC TGGCTTTGCT GCACTGTGGT GGGCTCCGCC 

3 0 CAGTCCGAAC 72 0 

TTCCCCCAGG GCTTTGTTTA CACTGTGAGG GGAAAATCAC CTACTCAAGC 
CTCAGTAATG 78 0 

3 5 GCGGATGCAC CTCCCCTCAC CAAGCTTGAG CATCTGGGGT CCACTTCAGA 
CTGCTGTGCT 84 0 
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GGCAGCAAGA ATTTCCAGCC AGTGGGTCTT AGCTTGCTGG GCTCTGTGGG 
GATTGGACCC 900 

AC TGAGC AAG ACCACTTGGC TCCCTGGCTT CAGCCCCCTT TCCAGCAGAG 
5 TGAATGATTC 9 60 

TGTCTCAGTG GGTTCCAGGC TCCACTGGGG TATGAAAAAA ACTCCTGCAG 
TTATCTTGGT 1020 

10 GACTGCCCAA ATCGCCACCC AGTTTTGTGC TTGAAACCCA GGGTTCTGGT 
AGTGTTGGCA 1080 

CTCCAGAGAA TCTCCTGGTC TGTGGGTTGC AAAAACCGTG GGAAAAGCGT 
AGTATCTGGG 1140 

15 

CCAGATAGCA CCTCACAGCA CAGTCCCTCA CAACTTCCCT TGGCTAGGGG 
AGGGAGTTCT 12 00 

CCCACCCCTT GTGCTTCCTG GGTGAAGCAG CGCCCCACCC TGCTTCTGCT 
2 0 TGCCCTCTGT 12 60 

GGGCTGCACC CACTTGTGTA ACCAGTCCCA GTGAGATGAT CCTGGTACCT 
CAGTTGGAAA 1320 

25 TGCAGAAATC ACCTGCCTTC TGCATTGGTC TCACTGGGAA CTGCAGACCA 
GAGCTGTTTC 13 80 

TAATCAGCCA TCTTGCCCTC TCTGGTCTGG TCGTTTTCTT TAAATTGGTT 
GATACAGGAG 1440 

30 

CAGTGATAGC ACAACAAATA TGCACAGATT TGGGGAAAGT CATCCTGCAT 
TATGGTCTGG 150 0 

TTCAAGAAAT TACATTTTAA TAGTTATAAT TTGGTATCAC CTTGTTTGTG 
3 5 ATACCAAACC 1560 
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AGATACAATA CACGTTTGCC TCATGTTATG ATTTGTTATT CAGATTACAC 
CAGTTATTAT 162 0 

TCATAACTAA GAGTGATTTC TCATCTCACA AGAGCCAAAT CCAAGGATAA 
5 TGGTGCCAAT 1680 

TGATAGTAAT GATTCTATGA ATACCCAGCA TTCTGGTCTA TCATAGACAC 
TTTCAGAACC 174 0 

10 ATTGAGTTGA AGGTAGAAGG TGGTTATATA ATAGAAGATG AACTGGTAGC 
TACTAGGGGC 180 0 

TCAGTGACAA CTATCTGGAG AGACATTCAT TCATCTCGAT CCCACATGAA 
GAGAGCATTT 1860 

15 

C TCCTGATTA TATAAGAAGT GGGTCAGAAA AGCCTGTCCA GTGAAGTATT 
GCTGCCTTCA 1920 

AAGTGTAGAA AACCTCACTA AATCTCCTTA GTGGAAGGAA GTTCACTGTA 
2 0 CAACAACTTA 1980 

TTTCATATTT ATGATAGTAT TTAGACATAT ACAAGGCTTT TTCACATCAA 
GAAACCTTAT 2 040 

25 TCACATAAGG CATCTCTATC CTGCCCTTCA TTTTACCAAG TCATCTGGAG 
CAGCAATCGC 2100 

CAACCTTGTT GGCATGAGGG ACCAGTTTTG TGAAAGACAA CGTTTTCATG 
GACTGGGGTC 2160 

30 

AAGGAATGGT TTGGGGATAA TTTAAGTGCA TTACTTTTAT TGTGCCCTTT 
ATTTCCATTA 22 20 

TTATTACATT GTGTAATAAT ATATAATAAA ATAATTATAC AACTCACCAT 
3 5 AATG TAG AAT 22 80 
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CAGTGGGAAC CCTGAGCTAG TTTTTCTGAA AGTAGATGGT ACCATCTGTG 
AGTGATGGGA 23 40 

GACAGTGACA GTTCATCAGG TATTAGATTC TCACAAGGAG CCCACAACCT 
5 AGATTCCTCA 2400 

CATGAGAAGT TCCCAATAGG GTTTGCCCTC CTATGAGAAT CTAATGCCAC 
TGCTGATCTG 2460 

0 ACAGGAGGTG GAGCTCATGA GGTAATGTGA GTGATGGGGA GTGGCTGTAA 
ATAC AGATG A 2520 

AGCTTCACTT ACTCATTCGC TGCTTACCTC CTGCTGTGCA GCCTGCTTCC 
TGACTCATCC 2580 

ATGGACCAGT ACTGATCCAT GGCCTAGGGG TTGGGGACCC CTAATCTAGA 
GCACTTGGAG 2 640 

AACTATCTGT TCTCCAAAGC TGATCAAATG CTATCATTAA TGTATCTAAT 

1 ATTTTAAGAA 2700 

AGGGTAACAC TGTTGAGAGC CAAATAGATA CATGGCCCAG AGCAAGCTTA 
AGTTACTAAT 2760 

AACTCCTTTT TCAGCTCACC CCCTGCTGAA GGCATGAGTT TGAATCTCAG 
TTTTGCCATT 2 820 

TGCTGTGTAA TGTATGCAAT TATATTTAGC ATCATATTTC TC AC TTG AAA 
AATGAAAATA 2880 

ATACATTTAA TAC TTAAC AG GAG TG TC AG A AAGTATATTA GCACTTGGTA 
ATTTATAC AA 2 94 0 

TACAATATAA AAGTAAGAAA TTTTTATTTT ATTTATTTTT ATTTTATTTT 
CAGCAATAAG 3 00 0 
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AGCTAACTGC CATCAAGAGC CATCAGTATG TCTTCAAGCT GCATGCCCAG 
AAAGC TGGAT 3 060 

TGGTTTTCAA AGAAAGTGTT TCTATTTTTC TGATGACACC AAGAACTGGA 
C ATCAAGTCA 312 0 

GAGGTTTTGT GACTCACAAG ATGCTGATCT TGCTCAGGTT GAAAGCTTCC 
AGGAACTGGT 3180 

AAGAAAATAG TTCTGGCCAG AATCAAAGAT TCAGCCCTAC AAGGATATGT 
TTTCCTGTGA 3240 

AATTATC TAA GAGGTAGGTT TAG AC ATC TG C TTTTAC ATT GATTTTTTTT 
TTTTTTTTTT 33 00 

TTTTTTGCAT AACGAAAGAG TAACCTAGCA TGTATTATAT TTTACAGTGA 
ACCATCTAAA 33 60 

ATTACC TTAA TATTCGTGGC AGGAACAGGC CCAGAGGGCA AGCAAGCCAG 
AGCCTTCTTT 3420 

G AC TTGTGAG CCAGAATTGT GCAAATAAGG ATTAGAAAAG TATTGGTAGA 
AACCCAGTTT 3480 

TAAGTTTGTA TGAAGTTAGC AACATTGTTT CAAAATAAAT CAAACAAGGC 
CAAGAGCAGT 3540 

GGCACATGCC TGTAATCCCA GC AC TTTGGG AGGCCAAGGC GGGTGTATCA 
CTTGAGGTCA 3 60 0 

GGAGTTTGAG ATCAGCCTGG CCAACATGGT GAAACCCCAT CTCAACTAAA 
AAATACAAAA 3 660 

ATTAGC TGGG CATGGTGGCA TACGCCTGTA GTTCCAGCTA CTCAGGAGGC 
TGAGGCAGCA 372 0 
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GAATTGCTTG AACCTGGGAG GTGGAGGCCT ACAGTTAGCT GAAATCATGC 
TACTGTACTC 3780 



CAGCCTAACA GAGTGAGACT C TATC TC AAA AAAATAATAA AATAAAAACA 
5 ATAAGTCAAG 3 840 

CAAGAATGAT GTCATAGAGG TTGGTAGACT AAAAAGCTAC AGAAATCTGT 
TCCTCCACTG 3 900 

0 AGAAAACTAT TGAACTGTCA AAAACTGTCT GAAGTAACTA TTTTGGAATT 
CTCGAGTCTA 3 960 

GTTAAAC AC T GGAAGCATCA AGGGAAGAGT TTGATAAAGA GGATGATAAA 
TTTTGGTTAA 4 020 

5 

TGTTGGTGAA TTTCAGCCTT TCCACTCAAT AATAACTATT TTCCATACCC 
CATTATTGCA 4080 

GGGATCCATG GGAACTGCTG CCCATGTTCT TGTAATGAAT TCCTGCAGCC 
3 AGGGTGAACA 4140 

ATAAGCACCT TTTTGTCCAA ATGTCAGGGT TATTGCTGAT TTCTGCCTTT 
GAATGC TGAG 42 00 

GGGCAGACAC AGAAGTGGGC TATCATTGCA TCAGTCCTCA TCAGCTGAAG 
TGGCTTCCCA 42 60 

AGGATTTAAA TAAATAGTAT GTGTTTTTCC TCCCTTTAGG AAGCAGTCAT 
TTAAGACAAT 43 2 0 

TTTTATTAGA TAACTGGCTG ACAGCAGAGA TAACAGAACA GAGATTTCAA 
TGACCATGCA 43 80 

CAACAGAGAA TAAAAATAGT TGGGAAAAAA TCATGACCAA ATGACTCTGA 
GCCACAACAA 4440 
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CCAAGATTTG ACAATCCCTG AAGAGCAAAA TAATTAAGTT ACCAGAGTTA 
CCACAACATA 4500 

GTATTCATAA TGTCCAGTTC TCAAAAAAAA ATTACAAAAC ATGCAAAGAA 
AAGTATGGTT 4560 

CATTCACAGG AAGAAAAAGT AATCTGACAG AAACTATCCC TGAAGAGGCT 
CAG ATATTAA 4 62 0 

AAATATGAGT CAAAAATGTT AAATCAGCTG TCTTAAGTAT AACCAATGAG 
TTAAAGGAAA 4 680 

CTAGACAAAA AGCTAAAGGA AACCGAAAAC ATAATAAATG AACAAAATTA 
GAATATC AAT 474 0 

ATAAAGGTAG AAATTGTAAA AAAGAACCAA GCAAAAATTC CAGAGCTGAA 
AAGTACAATG 4800 

AC TG AAATTT AAAAATAATT TTAAAAACTC AATGAAGAAG TTCAACAGCA 
GATTTGAGAA 4860 

GTAAGAGATC AG AAAAC TTG AAAATAAGAT AATTGAAACA ATCCAGACTA 
AGAAAAACAA 492 0 

AGAAAAAGAA TGAAGATAAA TAAATTC TAA GGAACCTGTA GGACATCAGC 
AAACATACTA 4980 

ACATATGTAC TGTAGAAATC CAGGAAAGAG AAGAGAAAGA GAAGCAGAGA 
AATACACTTA 5040 

AAGAAATAAT GAACAAAACT TTCCAAAATC TGAGGAAATA CATAAATATA 
TACATCCAAG 5100 

AGGCTCAATG AACTCCAAAA GGGTAAACTT AAAGAGATCT ACATTGAGAC 
AAAATATAGT 5160 
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CAAGTTGACA AAATCCACAG AGAGAATTTT GAAAGCAGCC AGAATGAAGC 
AACTCATCAT 5220 

TTACATAAGA CCCTGAATAA AATTAATAGC TGATTTTCTC TGAGAAACCA 
5 TGGAGATCAG 5280 

AAGGTAGTGG AATGGCATAT TTAAATGTCT GAAAGAAAAA ATAAAAC TGC 
CAACCATGAA 5340 

10 TTCTATGTAT AGCAAAGTTG TCCTTCAAGA ATGAAGGAAA AAGTAACACA 
TTTTC AGATA 540 0 

ACCAATAATT AAGGGATTTT ATTACCAGTA GACATGTGCT ACAGAAAATG 
C TAAAGG AAA 5460 

15 

CCTTTTAGGC TG AAC TG AAA G TAC AC TAG A CAGCAATTCA GAGCCTCCAA 
AATAAAG AAT 552 0 

ATTCATAAAA GTAACAATAG AGGTAAATAT AAAACCCAGA ATTACTACAT 
2 0 GTGTCATATA 5580 

GTTTATAACT TCTCCTATTT ATAGCTTTCT ATATTTATAT TTATCTATAA 
CTTCATAGGC 5640 

25 AAATGAATAA AAATTATAAA TATGATAGTG GTCATATAAT GTATAAAGAT 
GCAATCTGTG 5700 

ACAGTCTTAT GAAGCAGGGA TGAAGACATA TAGGATCAAA ATGTTTGCAT 
AG TTATTG AA 57 60 

30 

GC TATGTTG A TATTATGAAA TTATATTGTT AC AAG TTTAA GATGCTAATT 
ATAATTCTC A 582 0 

AGGTAACCAC TAATAAAATT ACCAAAATTA TGCAGAAAAG GAAAAAAGAA 
35 AAACAATACA 5880 
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CTATAAAAAA CCAATTAAAT ACAAAAAAAG TCAGTAACAG ACAACTTGAG 
AAACAAAGAC 5940 

ATATAAGATA TAGAGAAAAC AAATGATTAA ATGGCAAAAG TAAATCTTGT 
5 TTTAGTAATC 6000 

ACATTAAATA GAAAAGGATG AAGCCATCCT ATTAAAGGGC TG AG AC TG AC 
AAGTTGGCTA 6060 

10 AAAACTAAAA TAAATTAAAA AGAAAAACAA GACTCATCTA CATGCTGTCT 
ATAAGAGACT 612 0 

TGCCTTAGAT ATAAGGACAC AAAGAAGTTG AAAGTAAAAG GACTGAAAAA 
GATATTCCAT 6180 

15 

ACAAACAGTA GTAACCAAGA TAGTGCCGAG TGGC TATATT TTTGTCAAAC 
AAAATAAACT 6240 

AAAGTAAAAT TTACAAGAGA AAAAGAAGGG CATTATGCAT TGACAAAAAT 
20 TTTGACATAG 63 00 

CCAAATAATT ATGTTATAAA ATATATGTAC TTAATAATAC AGCCTCAAAA 
TATATGAAGC 63 60 

25 AATAATTGCT ATAATTTAAG GGAGAAAAGA ACAGTTCTAT GAAAAGTTAG 
AGAATGAAAT 642 0 

ATTCCACTTT CAACATGAGA TTAAACAACT AGACATAAGA TCAATAAGGA 
AATAGAAAAT 6480 

30 

TTGAACAACA CTATAAACCA ATTATCCCTA ACAGGCATAT ACAGAAGAAT 
CTACCCAACA .-..6540 

AGAGCAGAAT ATTAATTCTT CTCAAATGCA CATGGAACAT TCTTAAACCA 
3 5 TATGTTAGGC 6600 
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CACAAAACAA GTGTTAGTAA GTGTGAAAAT TTGAAGTCAT AAAAAGTATC 
TTTTGCAATT 6660 

ACAATGGAAT GAAGCTAGAA ATCAATAACT AGAAAAACCA GAAAAGTCAC 
5 GCATATGTAG 672 0 - 

AAATTTAAAA ACCCGCTCTT CAACAGCCAT TGGTCAAAGA AGAAATCACA 
AGGGACATTA 67 80 

0 GAAAATACCT TGAGACAAAT GAAGTAAAAA TACAAATAGC ACGTTTATGG 
TATACACTGA 6840 

AC ATAGTTC T AAGAGGGAAA TTTATAGCTG TGAGCAGTTA ACTAAAAAAG 
AAGAAAGATC - 6 9 0 0 

TCAAATCCAT AGCCTAACTG TACACTGTAA GGAACTAAAA AAAGTAAAAC 
AAAAATAGAA 6960 

GTCATCTTTA TGATTTGAAA GAGTAAAAGA TTTACCTAAT AAGTCCCTAA 
ATTTACTAAT 702 0 

AATAAAGAAA ATTGTTTATA TATTTAATTG CGTTAAAATT CAGAACTTGT 
AATCATAAAA 7080 

AGGACAGTAC ACATTGACAA GGAAACACAG CAAAGGAAAC CAGCCTATGC 
TGCTGCTGTT 7140 

GTGAGGATAA TTTGGTACAC TTACATTAGT TTGGTGTCTT TTCTTTCTCT 
TTCTTTCTTT 72 00 

CTTTCTTTCT CTTTCTTTCG TTCGTTCGTT CGTTCGTTTC TTTTTGAGAC 
AGAATCTCAC 72 60 

TCTATTGCCC AGGCTGGAGT GCAGTGGCGT GATCTTGGCT CACTACAACT 
TTTGTCTCCC 732 0 1 
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AGGTTCAAAT GATTCTCATG CCTCAGCCTC CCAAATAGCT GGGATTACAG 
GTGCATGCCA 73 80 

TCACGCCCAG CTAATTTTTG TATTTTTTTT AATAGAGAGG GGGCTTCATC 
5 ATGTTGGCCA 7440 

AGCCTAGTCT CAAACTCTTG GCCTCAGGTG ATCCGCCTGC CTCGGCCTCC 
CAAAGTACTG 7500 

0 GGATTACAGG TGCTTGGCCT GGTGGTGTCA TTTC TTAAAG TTGACAAAAA 
GCATATCCTG 7560 

GGGCCTAAAA ATTCTATTCT AGGACAGGTG CCAAGAATGT CATAGTAGCA 
TAC ATTCC AA 762 0 

ACTTGATAAA ACCCTGGTGT CAACCGATAG TATAATAGAT AAATTGGAGA 
AGAGTCATAC 7680 

AAAGGAGTAC AATACAGAAA CAAAAGTAAC CAAATTATCA AC AATTTC TC 

1 TCAGTTTTAA 7740 

ATTATCTTCT TTTGATATGT ATGATAATAT AGCACACCTA TTCTGTATGT 
ATTACTAAAC 7800 

AATACAAAAT CAAAAGGAAG AAAATTATGA GTAGTTAAGA ATATAGCATA 
GCAGCAACAT 7 860 

TTCTGGGAGA GGATGGGTTA TGTTAGATTA ATGAATATCA TCTCTGTGTT 
TTCTGAAAGA 7 92 0 

ATTTCCTGTT GAGATATAAA GGCCCATCTG ATCACTGGAT TGGGCTGAGC 
AGAGAACAAG 7 980 

GCCAACCATG GAAATGGATA AATGGTACTG AATGGACAAG ACAGTAAGTT 
CTAAAAATCT 804 0 



«rkeVKeep\ 3p ecis\ST VINCENTS 03ceoclast.doc 30/06/99 



- 47 - 

GGCAGTAATA TTTGTATTTG AATTTACTTT GCATTAAATC TGAAGTGTTC 
TCTAGTTACA 8100 

TGCTTTAAAA AATTC TC ATT TTAAGGTTAG TCATGAAAGA AGATGGTGCC 
5 AAC TTGTATG 8160 

TTGCAAAGGT TTCACAAGTT CCTCGAATGA ATCCAAGACC TGTCATGGTG 
AGGTAGACTG 822 0 

0 AC TGTGAAC T TGGCTCCAGG CTTATCTATG TCATTTTCAA ACACTTTCAT 
TTTAAGCAAA 8280 

CCATACAATA TC TTTAAGTC TGTTCCTTAC CTCCACAACA AAATTAAATT 
GCACTTGTCC 8340 

TCC TGATTTC ACAGGGTTGA TGTGAGGAAC AGAGGTTTTG ATGTATCAGG 
GAAAGATTAT 8400 

GAGTGACAGC AATTATACCT ATTATTTAAA ATAAGACAAT AGTTTTAAAA 
» TTTTAAAATG 8460 

GGTAAAGTTT GGCACTAGAA AATTTAATCT CAATTGTATA TTTATAGGAT 
CTTCAGATTA 8520 

C TAAAAAG AT TTGAGATAAT GCTGGAAAAA TTGGATTCAC ACAATTTCAC 
TCAATGTTTG 8580 

TCTGAGAGAT GAGACAGTTT TGAAAAGCTA CTTTATTGTA ATACATTCAT 
C AATATTGGA 8640 

AATATAACTT TATTTAATAA AAAGAGCCCC AGACTGGACA TTGGCAGGTT 
TGAAATGAGT 87 0 0 

TTTTTC TC AT TAGCTTTTGA CCTTGGATGG GATGGTAAGT TTTAGAAATC 
AGAGAACATG 87 60 
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TACATTTATA CATTGTTGTA TCTACACTGC CTTGCACATT GTGACTGCTT 
C ATAAATATC 882 0 

TAGAATTAAC TTTTATTTCT TATTTTACAA TACGGAAGTA GTAAATTTTC 
5 TCTACCTAAT 8880 

TCTTAAAATG GTTTTTTGTT TGTTTGTATT TTTGAGAGAC AGGGTCTTAC 
TCTGTTACCC 8940 

10 AGGCTGGAGT GCAGTAGTAC CATCGTGGCT CACTGCAGCC TTGACTTCCC 
TGGCTCAAGT 9 000 

GAGCCTCCCA TCTCAGCCTC CTGAGTAGCT GGG AC TAC AG GTGTGTGCCA 
CCTTGCTTGG 9060 

15 

C TTTTTTTTT TTTTTTTTTT TTTTTTTCAG CGATGGGGTC TC AC TATGTT 
GCCTGGGCTG 912 0 

ATCTTGAACT CCTGAGCTCA AGCAATCCTC CCACCTCGGC CTCCCAAAAT 
2 0 GTTGGAATTA 9180 

CAGGTGTGAG CCACCATGCC TGGCCTCTCA AAATATTTTA AGGATCAAAT 
ATATTATTAA 9240 

2 5 CTAACCAGTT TTTGGAAACT GCTCATCACT TAAAGAAATG TAAAATATTA 

TATG ATTAAG 93 00 

GTCTAACAAG TTTCAACAAT TAGCAAATTA TATCATAGAT GATAGTGATT 
CCAATGAGCA 93 60 

30 

AAGAGGAAAA ATTTATAATC CAAATGCTGA CCTAAAATAT CTGTGCCAAG 
CCATCTAAAC 942 0 

TCAGCTAAAT AGCACTGCAG TTTCAGTACT AAAACCACCA GGGAAGTAGG 

3 5 AGGAATAAAA 9480 
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TCAAGCATGG TTTTTAGAAA TAGCTGCTGA GTCTTCAGTT ATTTAAGGAA 
GCAAAATATT 9540 

GGGAAACTGT GTAAAGAAAA CGTGTCAGAC TTCTCCCATC AGCCAGCTAA 
5 GGCTTTGGAT 9600 

GTACTTGAAA GAATATTATG C TTAC AG AC A TGAAATAGGT TTGATTCAGG 
AC TTTGC AGT 9660 

0 ATTCCTATAG TTGATTTATA ACATCTCCTG CTAAGCAAAG CCCACTGACT 
AATTAGTCAC 9720 

CACTACACAA GGAAAAACAG CATTATTTTT AGAGGC TGAA TTAATGTTAG 
TTTTCTCATT 9780 

5 

TTCTCATCTT CATTCTCTCT GCTGTTGAAG AAATGTTCAG TGGCCAACTG 
ATTCTGCTTC 9840 

TTCTCTTGCA GGTTTCCTAT CCTGGGAGCA GGAGAGTGTG CCTATTTGAA 
) TGACAAAGGT 9900 

GCCAGTAGTG CCAGGCACTA CACAGAGAGG AAGTGGATTT GTTCCAAATC 
AGATATACAT 9960 

GTCTAGATGT TACAGCAAAG CCCCAACTAA TCTTTAGAAG CATATTGGAA 
CTGATAACTC 1002 0 

CATTTTAAAA TGAGCAAAGA ATTTATTTC T TATACCAACA GGTATATGAA 
AATATGC TC A 10080 

ATATCACTAA TAACTGGGAA AATACAAATC AAAATCATAG TAAAATATTA 
CCTGTTTTCA 10140 

TGGTGCTAAT ATTACCTGTT CTCCCACTGC TAATGACATA CCCGAGACTG 
AGTAATTTAT 1020 0 

AAATAAAAGA GATTTAATTG A 10221 
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(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : NO 
10 (iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GAGTGTTGTC TGTCCACTTC C 21 

15 

(2) INFORMATION FOR SEQ ID NO : 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
20 ( C ) STRANDEDNESS : s ingl e 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: cDNA 
( iii ) HYPOTHETICAL : NO 
(iv) ANTI -SENSE: YES 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TTTCCAACTC CAATCCAGTT T 21 



3 0 (2) INFORMATION FOR SEQ ID NO : 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGGAGCTGA GTTTCCACTA C 21 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGT AGG GAA GCC TTT GTG AC -3 ' . 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acid 

(B) TYPE: amino acid 

(C) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

H-Cys-Met-Arg-Thr-Glu-Ala-Gln-Leu-Ala-Arg-Phe-Asp-Asn-Gln- 
Asp-Glu-Leu-Asn-Phe-OH 

(2) INFORMATION FOR SEQ ID NO : 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE.- nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(.xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCCACGCGTT TGTCAGCAAC AAAGACAGAA CAG 3 3 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GCCACGCGTG GGACCATAGG GGAAAAAGTA G 31 



(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 29: 



ACAATGGTTC TTGCCAGCTC TACCACCAGC ATCCACACCA TGCTGCTCCT 
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GCTCCTGATG 60 

CTCTTCCACC TGGGACTCCA AGCTTCAATC TCGGCGCGCC AGGACTACAA 
GGACGACGAT 12 0 

GACAAGACGC GTTTGTCAGC AACAAAGACA GAACAGATCC CAGTCAACAA 
GACCTATGCT 180 

GCTTGCCCGC AAAAC TGG AT TGGAGTTGAA AATAAATGTT TTTATTTTTC 
TGAATACCCA 24 0 

AGTAACTGGA CATTCGCCCA GGCCTTCTGC ATGGCACAAG AGGCCCAACT 
AGCTCGGTTT 3 0 0 

GACAACCAGG ATGAGCTGAA TTTCCTAATG AGATACAAGG CGAATTTTGA 
TTCCTGGATT 3 60 

GGCCTGCACA GAGAGTCGTC AGAGCACCCT TGGAAGTGGA CAGACAACAC 
TGAGTATAAC 42 0 

AACACGATTC CCATCCGGGG AGAGGAAAGA TTTGCCTACC TGAACAACAA 
CGGGATCAGC 480 

AGTACCAGGA TCTATTCACT TCGGATGTGG ATCTGTAGCA AGCTCAACAG 
CTATAGCCTC 540 

CACTGCCAAA CTCCTTTTTT TCCTTCCTAG CATTTACCAA GAGACGCTTT 
TTAGCCTGTT 60 0 

ATCTGTGGGT GCTACTTTTT CCCCTATGGT CCC 

660 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

10 

GCCACGCGTT CAGTAAAAAA GACAGCCAAG 3 0 



(2) INFORMATION FOR SEQ ID NO : 31: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
2 0 (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
2 5 GCCCAGCGTA ACTACAGGCA CTGTGAGG 2 8 
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